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The invention is directed towards the isolation, characterization and use of human organic anion transporter polypeptides for phar- 
macological screening of substrates, analogues of substrates, and modulators of human organic anion proteins. This invention also 
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ORGANIC ANION TRA NSPORTER REIVES AND PROTEINS 

Related Applications 

This application claims the benefit of U.S. Serial No. 60/143,771, entitled 
5 "Organic Anion Transporter Genes and Proteins," which was filed July 1 2, 1 999 and 
which is incorporated by reference herein in its entirety, including figures. 

Field of the Invention 

The present invention relates to specific proteins as well as recombinant versions 
10 of these proteins which are organic anion transporters. These proteins include human 
membrane proteins preferentially found in kidney and liver cells. The present invention 
also relates to nucleotide sequences encoding these novel organic anion transporters. In 
another aspect, the present invention relates to methods for using these proteins as in vitro 
screening agents to identify substrates for the proteins and inhibitors which block transport 
1 5 activity. In yet another aspect, these proteins may be used in in vitro assays to predict 
drug pharmacokinetics, drug distribution and drug toxicity. The invention also discloses 
antibodies that specifically bind to these proteins and which may be used in diagnostic 
assays. 



25 



20 Background of the Invention 

The following describes certain relevant art, none of which is admitted to be prior 
art to the inventions described herein. 

The liver and kidney are two organs that can extract a variety of organic anions 
from circulation, including endogenous compounds such as bile acids and bilirubin and 
xenobiotics such as sulfobromophthalein (BSP) and >-aminohippurate (PAH). The liver 
and kidney thus have the critical functions of bile secretion, detoxification, and drug 
elimination. Since the cell membrane represents a hydrophobic barrier that prevents influx 
of charged or hydrophilic molecules, hepatocytes and kidney epithelial cells express 
proteins on their basolateral membranes to facilitate transport of organic anions into the 
cell. The liver and kidney contain membrane proteins that transport very specific 
compounds such as taurocholic acid or prostaglandins. However, they also have proteins 
that exhibit a wider substrate specificity. Recently, transporters of the latter type have 
been cloned from rat liver and kidney. One such protein is called the organic anion 
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transporting polypeptide, oatp (Jacquemin et al. (1994) PNAS 91:133-37). This protein 
belongs to a family of related transporters, members of which consists of oatpl, oatp2, 
oatp3, and prostaglandin transporters. These proteins share homology at the amino acid 
level (Noe et al. (1997) PNAS 94: 10346-50; Kanai et al. (1995) Science 268: 866-69; Abe 
5 et al. (1 998) J. Biol. Chem. 273: 22395-401). When expressed in Xenopus oocytes or in 
mammalian cells, members of the oatp family were shown to transport many types of 
organic anions including taurocholic acid, BSP, and conjugated steroid hormones. The 
related prostaglandin transporters show high affinity transport of prostaglandins. 

Shortly after the cloning of oatpl, members of another organic anion transporter 

10 family, OAT1 and OAT2, were isolated from rat kidney and liver, respectively (Sekine et 
al. (1997) J. Biol Chem 272: 18526-29; Sekine et al. (1998) FEBS Letters 429: 179-82). 
Rat OAT1 and OAT2 are homologous to each other but show no significant homology to 
members of the oatp family. However, they show some homology to the family of organic 
cation transporters (OCTs), suggesting that these two families of proteins share a common 

15 origin. Rat OAT1 and OAT2 can transport many different organic anions. OAT1, when 
expressed in oocytes, can transport /?-aminohippurate (PAH), methotrexate, and glutarate. 
OAT1 is most likely the molecular entity responsible for the classical /?-aminohippurate/<x- 
ketoglutarate exchanger found in the kidney proximal tubule. OAT2, expressed 
predominantly in the liver, has been shown to transport organic anions such as salicylate, 

20 methotrexate, and a-ketoglutarate. Therefore, OAT1 and OAT2 belong to yet another 
family of multi-specific organic anion transporters distinct from the oatp family. 



Summary of the Invention 

The present invention is directed to a group of polypeptides, preferentially 
25 expressed in the liver and kidneys of humans and which have activity as organic anion 
transporters. We refer to these polypeptides as 4 'human organic anion transporters" or 
"hOAT polypeptides". These proteins and their properties are described in detail below. 

The invention also concerns, nucleic acids encoding hOAT polypeptides, cells, 
tissues and animals containing such nucleic acids, antibodies to the polypeptides, assays 
30 utilizing the polypeptides, and methods relating to all of the foregoing. 

A first aspect of the invention features an isolated, enriched, or purified nucleic 
acid molecule encoding an hOAT polypeptide or encoding a fragment of an hOAT 
polypeptide. 

2 



1NSDOCID: <WQ 01042B3A2J _> 

i 



WO 01/04283 PCT/US00/I8980 

In preferred embodiments the isolated nucleic acid comprises a nucleic acid 
sequence set forth in SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID 
NO:5, SEQ ID NO:6, or a functional derivative thereof, a nucleic acid sequence that 
hybridizes to the nucleic acid sequence set forth in SEQ ID NO:l, SEQ ID NO:2, SEQ ID 
5 NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, or a functional derivative thereof. 

The nucleic acid may be isolated from a natural source by cDNA cloning or 
subtractive hybridization; the natural source may be mammalian (human) blood, semen, or 
tissue and the nucleic acid may be synthesized by the triester or other method or by using 
an automated DNA synthesizer. Preferably, the nucleic acid is isolated from mammalian 
1 0 kidney or brain, most preferably from liver. ♦ 

Individual clones isolated from a cDNA library may be purified to electrophoretic 
homogeneity. The claimed DNA molecules obtained from these clones can be obtained 
directly from total DNA or from total RNA. The cDNA clones are not naturally 
occurring, but rather are preferably obtained via manipulation of a partially purified 

15 naturally occurring substance (messenger RNA). The construction of a cDNA library 
from mRNA involves the creation of a synthetic substance (cDNA) and pure individual 
cDNA clones can be isolated from the synthetic library by clonal selection of the cells 
carrying the cDNA library. Thus, the process which includes the construction of a cDNA 
library from mRNA and isolation of distinct cDNA clones yields an approximately 10 6 - 

20 fold purification of the native message. Thus, purification of at least one order of 

magnitude, preferably two or three orders, and more preferably four or five orders of 
magnitude is expressly contemplated. 

In another preferred embodiment, the nucleic acid molecules of the invention 
comprise nucleotide sequences that (a) have the nucleic acid sequences set forth in SEQ 

25 ID NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, or a 
functional derivative thereof, or encode polypeptides having the full-length amino acid 
sequences set forth in SEQ ID NO:7, SEQ ID NO:8; SEQ ID NO:9, SEQ ID NO: 10, SEQ 
ID NO:l 1, SEQ ID NO: 12, or a functional derivative thereof; (b) is the complement of the 
nucleotide sequence of (a); (c) hybridizes under highly stringent conditions to the 

30 nucleotide molecules of (a) and encodes a naturally occurring hOAT polypeptide; (d) 

encodes an hOAT polypeptide having the full-length amino acid sequence of the sequence 
set forth in SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:l 1, 
or SEQ ID NO: 12 except that it lacks one or more of the domains selected from the group 
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consisting of an extracellular domain, a transmembrane domain, and an intracellular 
domain; or (e) is the complement of the nucleotide sequence of (d). 

The nucleic acid molecules of the invention are isolated, enriched, or purified 
preferably from a mammal, more preferably from a human. 
5 Hybridization techniques may be used to isolate a nucleic acid molecule of interest 

Various low or high stringency hybridization conditions maybe used depending upon the 
specificity and selectivity desired. Stringency is controlled by varying salt or denaturant 
concentrations. Highly stringent conditions may mean conditions that are at least as 
stringent as the following: hybridization in 50% formamide, 5x SSC, 50 mM NaH;jP04, 

10 pH 6.8, 0.5% SDS, 0.1 mg/mL sonicated salmon sperm DNA, arid 5x Denhart solution at 
42 °C overnight; washing with 2x SSC, 0.1% SDS at 45 °C; and washing with 0.2x SSC, 
0.1% SDS at 45 °C. Those skilled in the art will recognize how such conditions can be 
varied to vary specificity and selectivity. 

Particularly preferred embodiments of this aspect of the invention are naturally or 

15 non-naturally occurring variants of the nucleic acids of the invention. Among variants in 
this regard are variants that differ from the aforementioned nucleic acid molecules by 
nucleotide substitutions, deletions or additions. The substitutions, deletions or additions 
may involve one or more nucleotides. The variants may be altered in coding or non- 
coding regions or both. Alterations in the coding regions may produce conservative or 

20 non-conservative amino acid substitutions, deletions or additions. Such nucleic acid 

molecules are identifiable as being able to hybridize to or which are at least about 60-65% 
percent identical, preferably at least about 70-75% percent identical, more preferably at 
least about 80-83% percent identical, and even more preferably at least about 87-95% 
percent identical to the nucleic acid molecules shown in SEQ ID NO:l, SEQ ID NO:2, 

25 SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5 or SEQ ID NO:6. 

Among the particularly preferred embodiments of the invention in this regard are 
nucleic acid molecules encoding polypeptides having the amino acid sequences set forth in 
SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO:ll, SEQ ID 
NO:12; variants, analogs, derivatives and fragments thereof; and fragments of the variants, 

30 analogs and derivatives. 

Further particularly preferred in this regard are nucleic acid molecules encoding 
OAT variants, analogs, derivatives and fragments, and variants, analogs and derivatives of 
the fragments, which have the amino acid sequence for the hOAT polypeptides of SEQ ID 
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NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:l 1, SEQ ID NO:12 in 
which several, a few, 5 to 10, 1 to 5, 1 to 3, 2, 1 or no amino acid residues are substituted, 
deleted or added, in any combination. Especially preferred among these are silent 
substitutions, additions and deletions, which do not alter the properties and activities of the 
5 OAT polypeptides. Also especially preferred in this regard are conservative substitutions. 
In one embodiment, the polypeptide encoded by the nucleic acid molecule is at least about 
30-35%, preferably at least about 40-45%, more preferably at least about 50-55%, even 
more preferably at least about 60-65%, yet more preferably at least about 70-75%, still 
more preferably at least about 80-85%, and most preferably at least about 90-95% or more 

10 identical to the amino acid sequence of SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ 
ID NO: 10, SEQ ID NO:l 1, or SEQ ID NO:12. 

In still another embodiment, the nucleic acid molecule encodes a naturally 
occurring variant of the polypeptide of SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ 
ID NO:10, SEQ ID NO:ll, or SEQ ID NO:12 and hybridizes under stringent conditions to 

15 a nucleic acid molecule comprising SEQ ID NO: 1 , SEQ ID NO:2, SEQ ID NO:3, SEQ ID 
NO:4, SEQ ID NO:5 or SEQ ID NO:6, respectively. 

Further preferred embodiments are variaSFnts of the herein described nucleic acid 
molecules in which the transporting activity of the polypeptides is altered. Such 
alterations can result in either increased, or decreased transport of substrates recognized by 

20 hOAT polypeptides. Further, variants can be made that result in OAT polypeptides which 
are no longer responsive to cellular signals, for example, protein kinase C deactivation of 
transport. 

Especially preferred are nucleic acid molecules encoding OATs which are 
expressed on the basolateral (sinusoid) membrane of hepatocyte cells. 

25 Included in the invention are all possible nucleic acid sequences that encode the 

amino acid sequences of the hOAT polypeptides of the invention. As it is recognized that 
alternate codons will encode the same amino acid for most amino acids due to the 
degeneracy of the genetic code, the sequences of this aspect includes nucleic acid 
sequences utilizing such alternate codon usage for one or more codons of a coding 

30 sequence. For example, all four nucleic acid sequences GCT, GCC, GCA, and GCG 

encode the amino acid, alanine. Therefore, if for an amino acid there exists an average of 
three codons, a polypeptide of 100 amino acids in length will, on average, be encoded by 
3 100 , or 5 x 10 47 , nucleic acid sequences. Thus, a nucleic acid sequence can be modified 
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(e.g. 9 a nucleic acid sequence from an hOAT as specified above) to form a second nucleic 
acid sequence encoding the same polypeptide as encoded by the first nucleic acid 
sequence using routine procedures and without undue experimentation. Thus, all possible 
nucleic acid sequences that encode the amino acid sequences of the hOAT polypeptides of 
5 the invention are included in the present invention and are described, as if all were written 
out in full, taking into account the codon usage. 

The alternate codon descriptions are available in common textbooks, for example, 
Stryer, BIOCHEMISTRY 3 rd ed., and Lehninger, BIOCHEMISTRY 3 rd ed. Codon 
preference tables for various types of organisms are available in the literature. Because of 

10 the number of sequence variations involving alternate codon usage, for the sake of brevity, 
individual sequences are not separately listed herein. Instead the alternate sequences are 
described by reference to the natural sequence with replacement of one or more (up to all) 
of the degenerate codons with alternate codons from the alternate codon table, preferably 
with selection according to preferred codon usage for the normal host organism or a host 

15 organism in which a sequence is intended to be expressed. Those skilled in the art also 
understand how to alter the alternate codons to be used for expression in organisms where 
certain codons code differently than shown in the "universal" codon table. 

In a second aspect, the invention relates to a nucleic acid vector comprising a 
nucleic acid molecule encoding an hOAT polypeptide and a promoter element effective to 

20 initiate transcription in a host cell. 

Those skilled in the art would recognize that a nucleic acid vector can contain 
many other nucleic acid elements besides the promoter element and the hOAT nucleic acid 
molecule. These other nucleic acid elements include, but are not limited to, origins of 
replication, ribosomal binding sites, nucleic acid sequences encoding drug resistance 

25 enzymes or amino acid metabolic enzymes, and nucleic acid sequences encoding secretion 
signals, periplasm or peroxisome localization signals, or signals useful for polypeptide 
purification. 

A circular double stranded nucleic acid molecule can be cut and thereby linearized 
upon treatment with restriction enzymes. An assortment of vectors, restriction enzymes, 
30 and the knowledge of the nucleotide sequences that the restriction enzymes operate upon 
are readily available to those skilled in the art. A nucleic acid molecule of the invention 
can be inserted into a vector by cutting the vector with restriction enzymes and ligating the 
two pieces together. 

6 
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Many techniques are available to those skilled in the art to facilitate transformation 
or transfection of the expression construct into a prokaryotic or eukaryotic organism. 
These methods involve a variety of techniques, such as treating the cells with high 
concentrations of salt, an electric field, or detergent, to render the host cell outer 
5 membrane or wall permeable to nucleic acid molecules of interest. 

In a third aspect, the invention features a nucleic acid probe for the detection of a 
nucleic acid encoding an hOAT polypeptide, fragment or analogue in a sample. The 
nucleic acid probe contains nucleotides that will hybridize specifically to a sequence of at 
least 14 contiguous nucleotides set forth in SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3, 

10 SEQ ID NO:4, SEQ ID NO:5, or SEQ ID NO:6, or a functional derivative thereof. The 
probe is preferably at least 14 or more bases in length and selected to hybridize 
specifically to a unique region of an hOAT encoding nucleic acid. 

In preferred embodiments, the nucleic acid probe hybridizes to at least 14 
nucleotides of a nucleic acid encoding the full-length sequence set forth in SEQ ID NO:7, 

15 SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:l 1, SEQ ID NO:12, or a 
functional derivative thereof. Various low or high stringency hybridization conditions 
may be used depending upon the specificity and selectivity desired. Under highly 
stringent hybridization conditions, only highly complementary nucleic acid sequences 
hybridize. Preferably, such conditions prevent hybridization of nucleic acids having 1 or 2 

20 mismatches out of 20 contiguous nucleotides. 

In another preferred embodiment, the nucleic acid is an isolated conserved or 
unique region, for example, those useful for the design of hybridization probes to facilitate 
identification and cloning of additional polypeptides, or for the design of PCR probes to 
facilitate amplification or cloning of additional polypeptides. Such a nucleic acid may 

25 contain additional sequences in addition to the conserved or unique region, preferably 10- 
20 additional nucleotides on each side of the conserved or unique region, more preferably 
30-40 additional nucleotides, and most preferably 75-100 additional nucleotides. 

Methods for using the probes include detecting the presence or amount of hOAT 
polypeptide RNA in a sample by contacting the sample with a nucleic acid probe under 

30 conditions such that hybridization occurs and detecting the presence or amount of the 
probe bound to hOAT polypeptide RNA. The nucleic acid duplex formed between the 
probe and a nucleic acid sequence coding for an hOAT polypeptide may be used in the 
identification of the sequence of the nucleic acid detected (for example see, Nelson et al., 

7 
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in Nonisotopic DNA Probe Techniques, p. 275 Academic Press, San Diego (Kricka, ed., 
1992) hereby incorporated by reference herein in its entirety, including any drawings). 
Kits for performing such methods may be constructed to include a container having 
disposed therein a nucleic acid probe. 
5 Another feature of the invention is a nucleic acid molecule as set forth in SEQ ID 

NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, or SEQ ID NO:6, or 
fragments thereof, comprising one or more regions that encode an hOAT polypeptide or 
an hOAT domain polypeptide, where the hOAT polypeptide or the hOAT domain 
polypeptide is fused to a non-hOAT polypeptide or amino-terminal tag. Possible non- 

10 hOAT polypeptide or amino-terminal tag fusion partners include, for example, but are not 
limited to, glutathione-S-transferase (GST)-fusion proteins, green fluorescent protein 
(GFP) and fusions with histidine residues as described by Janknecht et ah, 1991, Proc. 
Natl Set USA 88:8972-8976. 

The invention also features recombinant nucleic acid, preferably in a cell or an 

15 organism. The recombinant nucleic acid may contain a sequence set forth in SEQ ID 

NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, or SEQ ID NO:6 or a 
functional derivative thereof and a vector or a promoter effective to initiate transcription in 
a host cell. The recombinant nucleic acid can alternatively contain a transcriptional 
initiation region functional in a cell, a sequence complimentary to an KNA sequence 

20 encoding an hOAT polypeptide and a transcriptional termination region functional in a 
cell. 

Another aspect of the invention relates to a recombinant cell or tissue comprising a 
nucleic acid molecule encoding an hOAT polypeptide or hOAT polypeptide fragment. 
The recombinant cell may comprise a nucleic acid molecule encoding an hOAT 
25 polypeptide; an hOAT domain polypeptide; an hOAT polypeptide fragment or an hOAT 
polypeptide or polypeptide fragment fused to a non-hOAT polypeptide. The recombinant 
cell can harbor a nucleic acid vector that is extragenomic or intragenomic. 

Extragenomic vectors are designed with their own origins of replication allowing 
them to utilize the recombinant cell replication machinery to copy and propagate the 
30 vector nucleic acid sequence. These vectors are small enough that they are not likely to 
harbor nucleic acid sequences homologous to genomic sequences of the recombinant cell. 

Multiple intragenomic vectors are available to those skilled in the art and contain 
nucleic acid sequences that are homologous to nucleic acid sequences in a particular 

8 
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organism's genomic DNA. These homologous sequences will result in recombination 
events that integrate portions of the vector into the genomic DNA. Those skilled in the art 
can control which nucleic acid sequences of the vector are integrated into the cell genome 
by flanking the portion to be incorporated into the genome with homologous sequences in 
5 the vector. 

A new combination of genes or nucleic acid molecules can be introduced into an 
organism using a wide array of nucleic acid manipulation techniques available to those 
skilled in the art 

In yet another aspect, the invention features an isolated, enriched, or purified 

10 hOAT polypeptide. 

It is also advantageous for some purposes that an amino acid sequence be in 
purified form. Purification of at least one order of magnitude, preferably two or three 
orders, and more preferably four or five orders of magnitude is expressly contemplated. 
The substance is preferably free of contamination at a functionally significant level, for 

15 example 90%, 95%, or 99% pure. The hOAT polypeptides, or fragments thereof, of the 
present invention are preferably isolated, purified, or enriched from a mammal or a 
mammalian cell. The mammal is as defined herein and preferably is a mouse, and most 
preferably is a human. These polypeptides may be isolated, purified, or enriched from a 
cell that comprises an endogenous nucleic acid molecule that encodes the polypeptide, or 

20 from a cell that is transformed with a nucleic acid molecule that encodes the polypeptide. 
The polypeptide may also be chemically synthesized. Procedures for obtaining 
polypeptides using the above methods are well known to those skilled in the art 

The polypeptides of the invention comprise an amino acid sequence having (a) the 
full-length amino acid sequence encoded by the nucleic acid sequences set forth in SEQ 

25 ID NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, or 
functional derivatives thereof, or the amino acid sequences set forth in SEQ ID NO:7, 
SEQ ID NO:8; SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:l 1, SEQ ID NO:12, or 
functional derivatives thereof. 

The invention also features an isolated, enriched, or purified hOAT polypeptide 

30 fragment 

Examples of polypeptide fragments include hOAT domains, and hOAT-specific 
epitopes. Such fragments may, for example, be produced by proteolytic cleavage of the 
full-length protein. Preferably, the fragment is obtained recombinantly by appropriately 
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modifying the DNA sequence encoding the proteins to delete one or more amino acids at 
one or more sites of the C-terminus, N-terminus, and/or within the native sequence. 
Fragments of a protein can be used as antigens for the production of hOAT-specific 
antibodies, or used as competitors of substrates for hOAT polypeptides. It is understood 
5 that such fragments may retain one or more characterizing portions of the native complex. 
Examples of such retained characteristics include: transporting activity; substrate 
specificity; interaction with other molecules in the intact cell; regulatory functions; or 
binding with an antibody specific for the native complex, or an epitope thereof. 

Well-known examples of domains are the SH2 (Src Homology 2) domain 

10 (Sadowski, et al., (1986) Mol Cell BioL 6:4396; Pawson and Schlessinger, (1993) Curr. 
BioL 3:434), the SH3 domain (Mayer, et al., (1988) Nature 332:272; Pawson and 
Schlessinger, (1993) Curr. BioL 3:434), and pleckstrin (PH) domain (Ponting, (1996) TIBS 
21:245; Haslam, et al., (1993) Nature 363:309), all of which are domains that mediate 
protein :protein interaction, and the kinase catalytic domain (Hanks and Hunter, (1995) 

1 5 FASEB J 9 :576-595). The relative homology is at least 20%, more preferably at least 30% 
and most preferably at least 35%. Computer programs designed to detect such homologies 
are well known in the art. 

Comparisons between the sequences of two or more polynucleotides or 
polypeptides can be performed using the local homology algorithm of Smith and 

20 Waterman, (1981) Adv. Appl. Math. 2:482, by the homology alignment algorithm of 
Needleman and Wunsch, (1970) J. MoL BioL 48:443, or the method of Pearson and 
Lipman, (1988) PNAS 85:2444. Computer programs implementing these methods can be 
used and include, BLAST, GAP, BESTFTT, FASTA, and TFASTA which are offered in 
the Wisconsin Genetics Software Package, Genetics Computer Group, 575 Science Dr*, 

25 Madison, Wis. 

Protein motifs can be identified using computer programs readily available to those 
skilled in the art One such computer program is offered by the Institute for Chemical 
Research at Kyoto University. The Kyoto computer program is offered at the following 
web site (http://www.motifgenome.ad.ipA . 
30 Protein topology, such as the orientation and location of transmembrane helixes 

can also be identified using readily available computer programs. One such program is 
TopPred2 offered by Stockholm University and offered at the following web site 
(http://www.biokeini.su.seA-server/toppred2/). 

10 



JNSDOCID: <WO 01 04283A2 I > 



WO 01/04283 



PCT/US00/18980 



Another aspect of the invention features an isolated, enriched or purified hOAT 
polypeptide analog. 

The hOAT polypeptide analogs of the present invention preferably have a 
substantially similar biological activity to the proteins encoded by the full-length nucleic 
5 acid sequence set forth in SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, 
SEQ ID NO:5, or SEQ ID NO:6, or the amino acid sequence set forth in SEQ ID NO:7, or 
SEQ ID NO:8; SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO:l 1, or SEQ ID NO: 12. 

A sequence that is substantially similar will preferably have at least 85% identity 
(more preferably at least 90% and most preferably 95-100%) to the amino acid sequence 
1 0 encoded by the nucleic acid sequence set forth in SEQ ID NO: 1 , SEQ ID NO:2, SEQ ID 
NO:3, SEQ ID NO:4, SEQ ID NO:5, or SEQ ID NO:6, or the amino acid sequence set 
forth in SEQ ID NO:7, SEQ ID NO:8; SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:ll, or 
SEQ ID NO: 12. 

An hOAT polypeptide analog will retain some useful function such as, for 

1 5 example, substrate binding, transport activity, or the ability to bind to an hOAT specific 
antibody (as defined below). The hOAT polypeptide analog may be derived from a 
naturally occurring complex component by appropriately modifying the protein DNA 
coding sequence to add, remove, and/or to modify codons for one or more amino acids at 
one or more sites of the C-terminus, N-terminus, and/or within the native sequence. It is 

20 understood that such analogs having added, substituted and/or additional amino acids 
retain one or more characterizing portions of the native hOAT polypeptides. 

Examples of hOAT polypeptides analogs include, but are not limited to, (1) 
chimeric proteins which comprise a portion of an hOAT polypeptide sequence fused to a 
non-hOAT polypeptide sequence, for example, a polypeptide sequence of 

25 glutathione-S-transferase, (2) hOAT polypeptides lacking a specific domain, for example, 
the protein kinase C domain, (3) epitope-tagged hOAT polypeptides or fragments for 
immuno detection or purification and (4) hOAT proteins having a point mutation. 

Human OAT polypeptide analogs with deleted, inserted and/or substituted amino 
acid residues may be prepared using standard techniques well known to those of ordinary 

30 skill in the art. For example, the modified components of the analogs may be produced 

using site-directed mutagenesis techniques (as exemplified by Adelman et aL, (1983) DNA 
2:183 ) wherein nucleotides in the DNA coding the sequence are modified such that a 
modified coding sequence is produced, and thereafter expressing this recombinant DNA in 
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a prokaryotic or eukaryotic host cell, using techniques such as those described above. 
Alternatively, proteins with amino acid deletions, insertions and/or substitutions may be 
conveniently prepared by direct chemical synthesis, using methods well known in the art. 
The analogs of the hOAT polypeptides may exhibit the same qualitative biological activity 
5 as the hOAT polypeptides themselves. 

In another aspect, the invention features an antibody (e.g., a monoclonal or 
polyclonal antibody) having specific-binding affinity to an hOAT polypeptide or hOAT 
polypeptide analog or fragment 

Antibodies having specific-binding affinity to an hOAT polypeptide may be used 

10 in methods for detecting the presence and/or amount of an hOAT polypeptide in a sample 
by contacting the sample with the antibody under conditions such that an immunocomplex 
forms and detecting the presence and/or amount of the antibody conjugated to the hOAT 
polypeptide. Diagnostic kits for performing such methods may be constructed to include a 
first container containing the antibody and a second container having a conjugate of a 

15 binding partner of the antibody and a label, such as, for example, a radioisotope. The 
diagnostic kit may also include notification of an FDA approved use and instructions 
therefor. 

For the production of polyclonal antibodies, various host animals may be 
immunized by injection with the antigen. Various adjuvants may be used to increase the 
20 immunological response, depending on the host species. 

Monoclonal antibodies may be obtained by any technique which provides for the 
production of antibody molecules by continuous cell lines in culture. Monoclonal 
antibodies may be obtained by methods known to those skilled in the art* See, for 
example, Kohler, et aL, (1975) Nature 256:495-497, and U.S. Patent No. 4,376,1 10 issued 
25 March 8, 1983, to David et al. 

Still further, the invention features a hybridoma which produces an antibody 
having specific-binding affinity to an hOAT polypeptide. 

In preferred embodiments, the hOAT antibody comprises a sequence of amino 
acids that is able to specifically bind an hOAT polypeptide. 
30 The invention features a method for identifying human cells containing an hOAT 

polypeptide or a related sequence. The method involves identifying the novel polypeptide 
in human cells using techniques that are routine and standard in the art, such as those 
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described herein for identifying hOAT {e.g., cloning, Southern or Northern blot analysis, 
Western blot analysis, immunoassay, in situ hybridization, PCR amplification, etc). 

The invention also features methods of screening cells for natural-binding partners 
of hOAT polypeptides. Binding partners include modulators and downstream signaling 
5 molecules such as adaptor proteins and may be identified by techniques well known in the 
art such as co-immunoprecipitation or by using, for example, a two-hybrid screen. (Fields 
and Song, U.S. Patent No. 5,283,173, issued February 1, 1994, and incorporated by 
reference herein). The present invention also features the purified, isolated or enriched 
versions of the polypeptides identified by the methods described above. 
1 0 The invention provides methods for screening compounds for their ability to 

inhibit, or modulate the biological activity of the human organic anion transporter 
molecules of the invention. In preferred embodiments, cells expressing an hOAT 
polypeptide, including recombinant expression constructs of the invention, are contacted 
with a compound, and transport activity is assayed. In especially preferred embodiments, 
1 5 competition assays, using known human organic anion transporter substrates, such as p- 
aminohippurate, are used to test the ability of a screened compound to interfere (compete) 
with the uptake of a known substrate of an hOAT polypeptide. Additional preferred 
embodiments comprise quantitative analyses of such effects. 

The present invention is also useful for the detection of substrates, analogues of 
20 substrates, inhibitors and modulators, heretofore known or unknown, for the transporters 
of the invention, either naturally occurring or embodied as a drug. In preferred 
embodiments, such substrates, analogues, or modulators may be detected in blood, saliva, 
semen, cerebrospinal fluid, plasma, lymph, vitreous humor, or any other bodily fluid, hi 
additional preferred embodiments, the invention provides methods for detecting and 
25 identifying substrates, analogues of substrates, or modulators that preferentially affect 
either the uptake function or the efflux function of the transporters of the invention. 

One method for identifying a substance capable of modulating an hOAT 
polypeptide activity comprises the steps of (a) contacting an hOAT polypeptide with a test 
substance; and (b) determining whether the substance alters the transporting activity of 
30 said polypeptide. 

Another method of identifying substances capable of modulating the function of an 
hOAT polypeptide comprises the following steps: (a) expressing an hOAT polypeptide in 
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cells; (b) adding a compound to the cells; and (c) monitoring the transporting activity of 
the hOAT. 

A method of identifying substrates or inhibitors of hOAT polypeptides comprises 
the following steps: (a) adding a test substrate to a cell line expressing an hOAT 
5 polypeptide; and (b) measuring the intracellular concentration of the test substrate. 

A second method of identifying substrates or inhibitors of hOAT polypeptides 
comprises the following steps: (a) adding a test substrate to a cell line expressing an hOAT 
polypeptide; (b) adding a substrate known to be transported by said hOAT polypeptides; 
(c) measuring whether said test substrate competes with the uptake of the known substrate. 
10 Another method of identifying substrates of an hOAT polypeptide comprises the 

following steps: (a) attaching a fluorescent compound onto a test compound; (b) adding 
said compound to a cell line expressing an hOAT polypeptide; and (c) monitoring whether 
said fluorescent compound is taken into said cell line through the use of fluorescent 
microscopy or any other method which can detect fluorescence. 
15 In a preferred embodiment, high- throughput screening employing 96-well plates 

and a microtiter® fluorescence detection system is used to detect potential modulators, 
substrates, analogues of substrates and inhibitors of hOAT polypeptides which can 
compete with binding or transport of fluorescent compounds known to bind or be 
transported by hOAT polypeptides. 
20 In yet another embodiment, uptake of a test substrate by an hOAT expressing cell 

line is measured using knowledge of an hOAT's antiporter activity. Uptake is measured 
by monitoring efflux of an accompanying antiporter molecule. Efflux of the antiporter 
molecule can be detected using techniques known to those skilled in the art, for example, 
radiolabeling of the effluxed molecule and measurement using a scintillation counter. 
25 Measurement of antiporter efflux is also amenable to high-throughput screening. 

A still further aspect of the invention is the identification of substrates, analogues 
of substrates, inhibitors, or modulators that affect the pharmacokinetics of drugs and 
compounds transported by hOAT polypeptides. Substances which decrease the 
transporting activity of hOAT polypeptides are useful for increasing the half-life in the 
30 body of drugs and compounds excreted via human organic anion transporters. 

Yet, another aspect of the invention involves substrates, and modulators, which 
increase the transporting activity of an hOAT. Such substances are advantageous for 
increasing the delivery of a drug or compound, which is transported by an hOAT, to a 
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target organ such as kidney or liver. For example, a modulator which increases the 
transporting activity of hOATl would increase the delivery of PAH or other drugs or 
compounds transported by hOATl to the kidney. 

A last aspect of the invention features a cultured cell line which stably expresses an 
5 hOAT polypeptide. Such cell lines are useful for identifying substrates which are 

transported by hOAT polypeptides, inhibitors of hOAT polypeptides and for identifying 
modulators of hOAT polypeptides. Further, a panel of cells stably expressing hOAT 
polypeptides will be useful in studying the pharmacokinetics of various drugs and 
compounds. Preferably, the expression of an hOAT polypeptide in such cell lines is under 

10 the control of an inducible promoter. More preferably, said inducible promoter has a very 
low basal level of expression in such cell lines and high expression when induced. 

A cell line stably expressing an hOAT polypeptide is useful for designing 
substrates which can be transported by an hOAT polypeptide at either an increased or a 
decreased rate. Decreasing the rate of transport can increase the effectiveness of some 

1 5 medicinals by making their clearance from the body occur more slowly. 

Cell lines stably expressing hOAT polypeptides are also useful for the 
identification and development of drugs or compounds targeted to a particular organ. For 
example, drugs or compounds identified as substrates of hOATl would be good 
candidates for treating diseases of the kidneys whereas those identified as substrates of 

20 hOAT2 would be good candidates for treating diseases of the liver. 

The OAT expressing cell lines are also useful for identifying potentially toxic 
compounds or drugs. For example, newly discovered or modified drugs belonging to a 
class of drugs known to be toxic to certain organs, e.g., kidney can be tested for transport 
against the panel of cells expressing human OAT1 or hOAT3 polypeptides. If the new 

25 drug or compound is transported by hOATl expressing cells, the new drug or compound 
may also be toxic to the kidney. However, if the new drug or compound, belonging to a 
class known to be toxic, is not transported by hOATl or hOAT3 expressing cells than the 
new drug or compound would be expected to be less toxic to the kidney than the parent 
class of drugs or compounds. 

30 A preferred embodiment includes the identification of toxic compounds which are 

substrates of hOATs, and which can be used to treat diseases of organs or tissues that 
express hOATs. For example, toxic compounds found to be transported by hOATl or 
hOAT3 would have potential use as anti-cancer drugs to treat cancers of the kidney. 
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The methods of the present invention can utilize any of the molecules disclosed in 
the invention. These molecules include nucleic acid molecules encoding hOAT 
polypeptides, nucleic acid vectors, recombinant cells, polypeptides, or antibodies 
described herein. 

The summary of the invention described above is non-limiting and other features 
and advantages of the invention will be apparent from the following detailed description, 
and from the claims. 



Definitions 

1 0 By "isolated" in reference to nucleic acid it is meant a polymer preferably 

consisting of 14 or more nucleotides conjugated to each other, including DNA or RNA 
that is isolated from a natural source or that is synthesized. The isolated nucleic acid of 
the present invention is unique in the sense that it is not found in a pure or separated state 
in nature. Use of the term "isolated" indicates that a naturally occurring sequence has 

15 been removed from its normal cellular (i.e., chromosomal) environment. Thus, the 

sequence may be in a cell-free solution or placed in a different cellular environment. The 
term does not imply that the sequence is the only nucleotide sequence present, but that it is 
essentially free (about 90-95% pure at least) of nucleotide material naturally associated 
with it and thus is meant to be distinguished from isolated chromosomes. 

20 By the use of the term "enriched" in reference to nucleic acid it is meant that the 

specific DNA or RNA sequence constitutes a significantly higher fraction (2-5 fold) of the 
total DNA or RNA present in the cells or solution of interest than in normal or diseased 
cells or in the cells from which the sequence was taken. This could be caused by a 
preferential reduction in the amount of other DNA or RNA present, or by a preferential 

25 increase in the amount of the specific DNA or RNA sequence, or by a combination of the 
two. However, it should be noted that "enriched" does not imply that there are no other 
DNA or RNA sequences present, just that the relative amount of the sequence of interest 
has been significantly increased. 

The term "significant" here is used to indicate that the level of increase is useful to 

30 the person making such an increase, and generally means an increase relative to other 
nucleic acids of about at least 2 fold, more preferably at least 5 to 1 0 fold or even more. 
The term also does not imply that there is no DNA or RNA from other sources. The other 
source DNA may, for example, comprise DNA from a yeast or bacterial genome, or a 
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cloning vector such as pUC19. This term distinguishes the sequence from naturally 
occurring enrichment events, such as viral infection, or tumor type growths, in which the 
level of one mRNA may be naturally increased relative to other species of mRNA. That 
is, the term is meant to cover only those situations in which a person has intervened to 
5 elevate the proportion of the desired nucleic acid* 

The term "purified" in reference to nucleic acid does not require absolute purity 
(such as a homogeneous preparation); instead, it represents an indication that the sequence 
is relatively purer than in the natural environment (compared to the natural level this level 
should be at least 2-5 fold greater, e.g., in terms of mg/mL). The term is also chosen to 

10 distinguish clones already in existence which may encode an hOAT polypeptide but which 
have not been isolated from other clones in a library of clones. Thus, the term covers 
clones encoding an hOAT polypeptide which are isolated from other non-hOAT clones. 

The term "nucleic acid molecule" describes a polymer of deoxyribonucleotides 
(DNA) or ribonucleotides (RNA). The nucleic acid molecule may be isolated from a 

1 5 natural source by cDNA cloning or subtractive hybridization or synthesized manually. 
The nucleic acid molecule may be synthesized manually by the triester synthetic method 
or by using an automated DNA synthesizer. 

The term "cDNA cloning" refers to hybridizing a small nucleic acid molecule, a 
probe, to cDNA. The probe hybridizes (binds) to complementary sequences of cDNA. 

20 The term "complementary" describes two nucleotides that can form multiple 

favorable interactions with one another. For example, adenine is complementary to 
thymine as they can form two hydrogen bonds. Similarly, guanine and cytosine are 
complementary since they can form three hydrogen bonds. Thus if a nucleic acid 
sequence contains the following sequence of bases, thymine, adenine, guanine and 

25 cytosine, a "complement" of this nucleic acid molecule would be a molecule containing 
adenine in the place of thymine, thymine in the place of adenine, cytosine in the place of 
guanine, and guanine in the place of cytosine. Because the complement can contain a 
nucleic acid sequence that forms optimal interactions with the parent nucleic acid 
molecule, such a complement can bind with high affinity to its parent molecule. 

30 The term **hybridize" refers to a method of interacting a nucleic acid sequence with 

a DNA or RNA molecule in solution or on a solid support, such as cellulose or 
nitrocellulose. If a nucleic acid sequence binds to the DNA or RNA molecule with higji 
affinity, it is said to "hybridize" to the DNA or RNA molecule. The strength of the 
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interaction between the probing sequence and its target can be assessed by varying the 
stringency of the hybridization conditions. Under highly stringent hybridization 
conditions, only highly complementary nucleic acid sequences hybridize. Preferably, such 
conditions prevent hybridization of nucleic acids having one or two mismatches out of 20 
5 contiguous nucleotides. 

By "conserved nucleic acid regions", it is meant regions present on two or more 
nucleic acids encoding an hOAT polypeptide, to which a particular nucleic acid sequence 
can hybridize under lower stringency conditions. Examples of lower stringency 
conditions suitable for screening for nucleic acids encoding hOAT polypeptides are 
10 provided in Abe, et al. (1992) J. Biol Chem. 19:13361 (hereby incorporated by reference 
herein in its entirety, including any drawings). Preferably, conserved regions differ by no 
more than 5 out of 20 contiguous nucleotides. 

By "unique nucleic acid region" is meant a sequence present in a full-length 
nucleic acid coding for an hOAT polypeptide that is not present in a sequence coding for 
15 any other known naturally occurring polypeptide. Such regions preferably comprise 12 or 
more contiguous nucleotides present in the fall-length nucleic acid encoding an hOAT 
polypeptide. In particular, a unique nucleic acid region is preferably of human origin. 

The tenn "nucleic acid vector" relates to a single or double stranded circular 
nucleic acid molecule that can be transfected or transformed into cells and replicate 
20 independently or within the host cell genome. 

The terms "transformation" and "transfection" refer to methods of inserting an 
expression construct into a cellular organism. 

The term **promoter element" describes a nucleotide sequence that is incorporated 
into a vector that, once inside an appropriate cell, can facilitate transcription factor and/or 
25 polymerase binding and subsequent transcription of portions of the vector DNA into 

mRNA. The promoter element precedes the 5* end of the hOAT polypeptide nucleic acid 
molecule such that the latter is transcribed into mRNA. Host cell machinery then 
translates mRNA into a polypeptide. 

By an "hOAT polypeptide" is meant the full-length amino acid sequence of SEQ 
30 IDNO:7, SEQIDNO:8, SEQIDNO:9, SEQIDNO.IO, SEQIDNO.ll, SEQIDNO:12, 
or a functional derivative thereof. 

The term "recombinant cell" refers to a cell that has been altered to contain a new 
combination of genes or nucleic acid molecules. 
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The term Recombinant organism" refers to an organism that has a new 
combination of genes or nucleic acid molecules. 

The term "organism" relates to any living being comprised of at least one cell. An 
organism can be as simple as one eukaryotic cell or as complex as a mammal. Therefore, 
5 a recombinant organism can also be a recombinant cell, which may be a eukaryotic or a 
prokaryotic organism. 

By Recombinant hOAT polypeptide" it is meant to include a polypeptide produced 
by recombinant DNA techniques such that it is distinct from a naturally occurring 
polypeptide either in its location (e.g., present in a different cell or tissue than found in 
10 nature), purity or structure. Generally, such a recombinant polypeptide will be present in a 
cell in an amount different from that normally observed in nature. 

The term "eukaryote" refers to an organism comprised of cells that contain a 
nucleus. Eukaryotes are differentiated from "prokaryotes" which do not have a nucleus 
and lack other cellular structures found in eukaryotes, such as mitochondria and 
15 endoplasmic reticulum. Prokaryotes include unicellular organisms, such as bacteria, while 
eukaryotes are represented by yeast, invertebrates, and vertebrates. 

The term "extragenomic" refers to a nucleic acid vector which does not insert into 
the cell genome. Thus, these vectors replicate independently of the host genome and do 
not recombine with or integrate into the genome. 
20 The term "intragenomic" defines a nucleic acid construct that is incorporated 

within the cell genome. 

By isolated" in reference to a polypeptide is meant a polymer of 6, 12, 18 or more 
amino acids conjugated to each other, including polypeptides that are isolated from a 
natural source or that are synthesized. The isolated polypeptides of the present invention 
25 are unique in the sense that they are not found in a pure or separated state in nature. Use 
of the term "isolated" indicates that a naturally occurring sequence has been removed from 
its normal cellular environment Thus, the sequence may be in a cell-free solution or 
placed in a different cellular environment. The term does not imply that the sequence is 
the only amino acid chain present, but that it is essentially free (about 90-95% pure at 
30 least) of material naturally associated with it. 

By the use of the term "enriched" in reference to a polypeptide it is meant that the 
specific amino acid sequence constitutes a significantly higher fraction (2-5 fold) of the 
total of amino acid sequences present in the cells or solution of interest than in normal or 
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diseased cells or in the cells from which the sequence was taken. This could be caused by 
a preferential reduction in the amount of other amino acid sequences present, or by a 
preferential increase in the amount of the specific amino acid sequence of interest, or by a 
combination of the two. However, it should be noted that "enriched" does not imply that 
5 there are no other amino acid sequences present, just that the relative amount of the 
sequence of interest has been significantly increased. 

The term "significant** here is used to indicate that the level of increase is useful to 
the person making such an increase, and generally means an increase relative to other 
amino acid sequences of about at least 2 fold, more preferably at least 5 to 10 fold or even 

10 more. The term also does not imply that there are no amino acid sequences from other 
sources. The other source amino acid sequences may, for example, comprise amino acid 
sequences encoded by a yeast or bacterial genome, or a cloning vector such as pUC19. 
The term is meant to cover only those situations in which a person has intervened to 
elevate the proportion of the desired amino acid sequences. 

15 The term "purified" in reference to a polypeptide does not require absolute purity 

(such as a homogeneous preparation); instead, it represents an indication that the sequence 
is relatively purer than in the natural environment (compared to the natural level this level 
should be at least 2-5 fold greater, e.g., in terms of mg/mL). 

By "an hOAT polypeptide fragment" it is meant an amino acid sequence that is 

20 less than the full-length hOAT amino acid sequences (and which excludes the listed EST 
sequences) encoded by the nucleic acid sequence set forth in SEQ ID NO: 1 (excluding 
EST clone R25797), SEQ ID NO:2 (excluding EST clone A1016020), SEQ ID NO:3 
(excluding EST clone A 1016020), SEQ ID NO:4 (excluding EST clone H41333), SEQ ID 
NO:5, or SEQ ID NO: 6 (excluding EST clone AA705512) or the amino acid sequence 

25 (excluding the amino acids encoded by the EST clones previously listed set forth in SEQ 
ID NO:7, SEQ ID NO:8; SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:l 1, or SEQ ID 
NO: 12, Thus, the term "fragment" is used to indicate a polypeptide derived from the 
amino acid sequence of the hOAT polypeptides, of the complexes having a length less 
than the full-length polypeptide from which it has been derived. 

30 By "an hOAT domain** it is meant a portion of the hOAT polypeptide having 

homology to amino acid sequences from one or more known proteins wherein the 
sequence predicts some common function, interaction or activity. 
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By "hOAT polypeptide analog" it is meant an amino acid sequence substantially 
similar to the sequence encoded by the nucleic acid sequence set forth in SEQ ID NO:l, 
SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ED NO:5. or SEQ ID NO:6, or the 
amino acid sequence set forth in SEQ ID NO:7, or SEQ ID NO:8; SEQ ID NO:9, SEQ ID 
5 NO:10, SEQ ID NO:l 1, or SEQ ID NO:12. 

By "biological activity** it is meant an activity of the hOAT polypeptides in a cell. 
The biological activity of the hOAT polypeptides is related to some of the activities of the 
cell which include, but are not limited to, transport (uptake or excretion of a substrate from 
a cell), cell proliferation, mitogenesis, metastasis, tumor escape, cell adhesion, 
1 0 transformation, or apoptosis. 

By "identity** is meant a property of sequences that measures their similarity or 
relationship. Identity is measured by dividing the number of identical residues in the two 
sequences by the total number of residues and multiplying the product by 100. Thus, two 
copies of exactly the same sequence have 100% identity, but sequences that are less highly 
15 conserved and have deletions, additions, or replacements have a lower degree of identity. 
Those skilled in the art will recognize that several computer programs are available for 
determining sequence identity. 

An hOAT polypeptide analog may differ from the native sequence of an hOAT 
polypeptide in that one or more amino acids have been changed, added or deleted. 
20 Changes in amino acids may be conservative or non-conservative. By "conservative" it is 
meant the substitution of an amino acid for one with similar properties such as charge, 
hydrophobicity, structure, etc. 

By "hOAT-specific epitope** it is meant a sequence of amino acids that is both 
antigenic and unique to hOAT. 
25 By "specific-binding affinity" is meant that the antibody binds to target (hOAT) 

polypeptides with greater affinity than it binds to other polypeptides under specified 
conditions. 

The term '^polyclonal" refers to antibodies that are heterogenous populations of 
antibody molecules derived from the sera of animals immunized with an antigen or an 
30 antigenic functional derivative thereof. 

"Monoclonal antibodies** are substantially homogenous populations of antibodies 

to a particular antigen. 
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The term "antibody fragment" refers to a portion of an antibody, often the 
hypervariable region and portions of the surrounding heavy and light chains, that displays 
specific binding affinity for a particular molecule. A hypervariable region is a portion of 
an antibody that physically binds to the polypeptide target. 
5 By "hybridoma" is meant an immortalized cell line which is capable of secreting 

an antibody, for example an hOAT antibody. 

By **natural-binding partner" it is meant a protein, organic anion, or other molecule 
that interacts with an hOAT polypeptide. 

As used herein, pharmacokinetics is the process by which a drug or compound is 
10 absorbed, distributed, metabolized and eliminated by the body. 

The term "compound" includes small organic or inorganic molecules of molecular 
weight of preferably less than 1000 atomic units, more preferably less than 800 atomic 
units, and most preferably less than 500 atomic units. "Organic molecules" include all 
molecules that contain a carbon atom, whereas "inorganic molecules" are those that do not 
15 have a carbon atom. 

The term "function" refers to the cellular role of a protein. The role of the proteins 
of the invention may include transport of substrates into and out of a cell, involvement in 
cascades controlling cell growth, migration, differentiation, gene expression, muscle 
contraction, glucose metabolism, cellular protein synthesis, and regulation of the cell 
20 cycle. 

The term "modulates" refers to the ability of a compound to alter the function of an 
hOAT polypeptide. A modulator preferably activates or decreases the transporter activity 
of an hOAT protein. A modulator that increases the transporting activity is a positive 
modulator; and one that decreases the transporting activity is a negative modulator. The 

25 term "modulates" also refers to altering the function of a protein by increasing or 

decreasing the probability that a complex, i.e. an assembly of at least two molecules bound 
to one another, forms between an hOAT protein and a natural-binding partner. 

The term "transporting activity," in the context of the invention, defines the ability 
of a transporter polypeptide to uptake a substrate into a cell or efflux a molecule out of a 

30 cell. 

The term "substrate" as used herein refers to a molecule that is transported into or 
out of a cell by an OAT polypeptide. The substrate may be an organic compound or 
molecule, inorganic compound or molecule, a peptide, or a protein. 
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The term "activates" refers to increasing the transport or efflux of a molecule into 
or out of a cell. 

The term "inhibitor** refers to a compound or substance that binds to a substrate- 
binding site and, thereby, decreases or prevents transport of an hOAT substrate. 
5 The term "expressing" as used herein refers to the production of an hOAT 

polypeptide from a nucleic acid vector containing an hOAT gene within a cell. 

The term "adding" as used herein refers to administering a solution comprising a 
compound to the medium bathing cells. 

The term "functional derivative** with respect to a polypeptide is a polypeptide that 
10 possesses a biological activity (either functional or structural) or an immunological 
characteristic that is substantially similar to a biological activity or an immunological 
characteristic of a non-recombinant hOAT. A functional derivative of an hOAT 
polypeptide may or may not contain post-translational modifications such as covalently 
linked carbohydrate, depending on the necessity of such modifications for the performance 
15 of a specific function. The term "functional derivative" is intended to include the 
"fragments**, "variants", "analogues**, "homologies" or chemical derivatives of a 
molecule. 

Similarly, a "functional derivative** of a gene encoding an hOAT polypeptide of 
the present invention includes "fragments", variants**, or "analogues'* of the gene, which 
20 may be "substantially similar*' in nucleotide sequence, and which encode a molecule 
possessing similar activity to an hOAT polypeptide or fragment thereof. Permutations 
resulting from degeneracy of the genetic code are also considered functional derivatives. 

A molecule is said to be "substantially similar** to another molecule if the sequence 
of amino acids in both molecules is substantially the same. Substantially similar amino 
25 acid molecules will possess a similar biological activity. Thus, provided that two 

molecules possess a similar activity, they are considered variants as that term is used 
herein even if one of the molecules contains additional amino acid residues not found in 
the other, or if the sequence of amino acid residues is not identical. 

A "chemical derivative** of a polypeptide contains additional chemical moieties not 
30 normally a part of the polypeptide. 

The term "mammalian" refers to such organisms as mice, rats, rabbits, goats, more 
preferably monkeys and apes, and most preferably humans. 
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Brief Description of the Figures 

FIG. 1 shows the cDNA and deduced amino acid sequences of human OAT1. 

FIG. 2 shows the cDNA and deduced amino acid sequences of human OAT2A. 

FIG. 3 shows the cDNA and deduced amino acid sequences of human OAT2B. 
5 FIG. 4 shows the cDNA and deduced amino acid sequences of human OAT3. 

FIG. 5 shows the cDNA and deduced amino acid sequences of human OAT4. 

FIG. 6 shows the cDNA and deduced amino acid sequences of human OAT5. 

FIG. 7 shows the amount of 3 H-PAH taken up by HeLa cells transfected with 
increasing amounts of hOATl or vector control (LacZ) cDNAs. 
10 FIG. 8 shows that oocytes injected with hOATl RNA uptake 3 H-PAH to a much 

greater extent than water injected oocytes. 

FIG. 9 shows that ECR293 cells stably containing hOATl, in plasmid pIND(spl), 
uptake approximately 50-fold more 3 H-PAH when induced by ponasterone than non- 
induced cells. 

15 FIG. 10 shows that ECR293 cells stably containing hOAT2A, in plasmid 

pIND(spl), uptake approximately 2.6-fold more 3 H-PAH when induced by ponasterone 
than non-induced cells. 

FIG. 1 1 shows that sulfobromophthalein but not a-kefoglutarate competed with 
3 H-PAH uptake in ECR293/OAT2A cells induced with ponasterone. 
20 FIG. 12 shows that oocytes injected with hOAT3 RNA uptake 3 H-PAH to a much 

greater extent than water injected oocytes. 

FIG. 13 (A-E) shows Northern Blot analyses of hOATl -5 mRNA in various 
human tissues. 

25 

' r 

DETAILED DESCRIPTION OF THE INVENTION 



30 A. Introduction 

Organic anions are carbon-based molecules with one or more negative charges at 
physiological pH. They can be endogenous compounds such as bile acids or xenobiotics 
such as />-aminohippurate. Organic anion transporters are membrane proteins that 
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facilitate the transport of organic anions across the cell membrane. The family described 
herein consists of proteins sharing at least 50% amino acid sequence similarity among 
each other. They show broad substrate specificity, meaning that each member can 
transport a variety of organic anions. Hence, they are often referred to as multi-specific 
5 organic anion transporters. The mechanism of transport, though not currently elucidated, 
probably is a secondary or tertiary active transport involving exchange of another organic 
anion. An example of a tertiary transport system is the rat kidney OAT1 transporter. The 
primary transport system is the Na + /K. + -ATPase, which establishes the high out-to-in Na + 
gradient. This Na + gradient then drives the co transport of Na + and dicarboxylates such as 
10 a-ketoglutarate through a secondary Na + /dicarboxylate transporter. Finally, the tertiary 
transport system is established by OAT1, which exchanges external PAH for intracellular 
a-ketoglutarate. 

The liver and kidney are two organs rich in organic anion transporters. 
Hepatocytes and epithelial cells of the kidney proximal tubules are polarized cells with a 

15 basolateral and apical membrane. For the hepatocyte, the basolateral membrane faces the 
blood-filled sinusoid. Thus, the basolateral membrane is also called the sinusoidal 
membrane. The apical membrane faces the canaliculus, into which bile is secreted. The 
hepatocyte thus can transport organic anions from the blood into the bile canaliculus 
across the basolateral and apical membranes. Similarly, cells of the kidney proximal 

20 tubule transport organic anions from the blood into urine across their basolateral and 
apical membranes. Examples of liver sinusoidal transporters are the oatpl and oatp2, 
whereas OAT1 is a kidney basolateral transporter. 



B. General Method 

25 The practice of the present invention employs conventional techniques of 

molecular and cell biology, microbiology, and recombinant DNA. Most techniques are 
well-described in J. Sambrook et al., "Molecular Cloning; A Laboratory Manual" (1989) 
and J. Celis, ed. "Cell Biology: A Laboratory Handbook" (1998). 



30 Identification of hOAT Nucleic Acid Molecules 

Extensive searches of Genbank databases were performed using database tools 
available through the website maintained by the National Center for Biotechnology 
Information (http://www.ncbi.nlm.nih.gov/). In particular, the Basic Local Alignment 
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Search Tool (BLAST) was used to perform sequence similarity searches of the Genbank 
databases, such as the database containing human expressed sequence tags (ESTs). The 
EST database contains sequences of randomly selected clones from a variety of human 
cDNA libraries. Since each EST clone was sequenced with one primer, about 300-500 bp 
5 of sequence information is available from each clone. The BLAST programs can translate 
these sequences in all possible reading frames and compare them to an input protein 
sequence. Human ESTs showing homology to rat OAT proteins where hereby identified. 

Identified EST clones can be obtained from licensed distributors of the I.M.A.G.E. 
Consortium Lawrence Livermore National Laboratory (LLNL) cDNA clones. The clones 
10 are then sequenced to verify their homologies to published sequences. DNA sequencing is 
usually done by Sanger's method of using dideoxynucleotide triphosphates in a primer 
extension reaction. The DNA to be sequenced, called the template, is first denatured and 

35 

then annealed to a specific primer, usually between 15-25 basepairs. In one method, S- 
adenosine triphosphate is incorporated into the growing DNA strand by a DNA 
15 polymerase. Materials for such a reaction are available from commercial kits such as 
Sequenase 2.0® from Amersham. Reaction products are separated on a long 
polyacrylamide gel, usually 6.0%, by electrophoresis using Tris-borate buffer. After 
electrophoresis, the gel is dried under vacuum and then exposed to autoradiographic film. 
The DNA sequence can be read from the autorad image and entered into computer for 
20 sequence analysis. Alternatively, fluorescent-tagged nucleotides can be incorporated by 
the polymerase chain reaction (PCR) into the template DNA using the thermal-stable 
polymerase from Thermits aquaticus (Taq polymerase). The PCR products can then be 
loaded onto an automated sequencer such as the ABI 377 (PE Applied Biosystems, 
Foster City, CA), which separates the products and determines the sequence. 
5 Isolation of cDNA clones from a particular tissue requires the construction of a 

cDNA library representing messages of that tissue. Library construction entails the 
isolation of mRNA from the source tissue and usually a fractionation step to separate low 
(< 0.5 kb) and high molecular weight species. The mRNAs are then primed with a DNA 
oligonucleotide, either a poly-thymidine or a mixture of random primers. Reverse 
0 transcriptase is then added to synthesize the complementary DNA strand. Short DNA 
fragments with sticky ends, called adapters, are ligated to the cDNAs so they can be 
subcloned into an appropriate vector. Popular vectors include bacteriophages such as 
Xgtl 1 or AZAP (Stratagene, La Jolla, CA). For phage libraries, the cDNAs are ligated to 
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lambda phage arms at the adapter site and then packaged into phage particles. Once 
packaged, the library can then be amplified by infecting an appropriate bacterial host. 
Ready-made libraries of various human tissues are available from several commercial 
sources. 

5 To isolate a particular gene of interest from the cDNA library, an aliquot is plated 

out onto large plates and then transferred to nitrocellulose or nylon membrane. The phage 
DNA attached to the membrane is denatured in alkali solution, neutralized, and then 
crosslinked to the filter by baking or UV irradiation. The filter is then hybridized with a 
probe of interest. The probe can be an oligonucleotide or a longer, double- stranded 

10 fragment of DNA, such as the insert of an EST clone. The double-stranded probe is 

usually labeled with 32 P to high specific activity (>10 9 dpm/^ig), denatured by boiling for 2 
minutes, and added to a hybridization buffer to concentration of 1 0 6 dpm/ml. 
Hybridization buffers can vary in composition depending on the hybridization condition 
desired. One common formulation contains 0.9 M NaCl, 90 mM sodium citrate (pH 7.0), 

15 10 mM EDTA (pH 8.0), 0.1% (w/v) Ficoll, 0.1% (w/v) polyvinylpyrrolidone, 0.1 (w/v) 
BSA, 0.5 % SDS, and 100 jig/ml sheared, denatured salmon sperm DNA. Other 
formulations are available from commercial sources. The filters are hybridized with the 
probe at 65°C overnight or at least 16 hours. They are then washed with sodium citrate 
solutions with decreasing ionic strength and increasing temperature. Finally, they are 

20 exposed to autoradiographic film with an intensifying screen at -80°C overnight 

If a plaque contains DNA sequences complementary to the probe, the probe will 
hybridize to the complementary sequences on the filter. The plaque containing the 
complementary sequences is detected by exposing X-ray film to the filter. The plaque of 
interest can then be isolated and purified by conventional methods. Furthermore, once 

25 purified, the phage DNA can be manipulated so the relevant cDNA insert is excised from 
the phage into a phagmid by an in vivo excision procedure using appropriate host cells. 
The phagmids can then be amplified in bacteria and isolated by conventional methods 
such as alkaline lysis. Once the plasmid is purified, it can be analyzed by standard means 
such as restriction endonuclease or PCR mapping and DNA sequencing. DNA sequence 

30 information is analyzed for open reading frames and compared to published sequences. 
As discussed earlier, sets of computer programs for DNA and protein sequence analysis 
are commercially available. 
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Once a cDNA coding for a full-length transporter protein is isolated, it can be 
excised from its cloning vector and subcloned into an appropriate expression vector. 
Subcloning is a frequently used procedure in recombinant DNA technology. It involves 
the digestion of DNA by restriction endonucleases, fractionation of the DNA fragments by 
5 agarose gel electrophoresis, and purification of the desired fragment by a DNA-binding 
matrix. The purified fragment is then ligated to a precut vector of choice by adding an 
enzyme such as T4 DNA Iigase. The new DNA construct can then be transfected into a 
mammalian cell line to test for functional expression. Many types of expression plasmids 
are commercially available and they all share common elements. An expression plasmid 

10 usually contains a bacterial origin of replication, a DNA promoter that is active in a 

mammalian cell line, a multiple cloning site, a transcriptional termination element, and a 
selective marker that confers resistance to an antibiotic. An expression plasmid harboring 
the cDNA of interest can be transfected into mammalian cells by several conventional 
means like calcium phosphate precipitation or lipofection. Many transfection kits are 

15 commercially available. 

Recombinant Cells Containing a Nucleic Acid Molecule Encoding an hOAT 

Cell lines which stably express hOAT polypeptides can be isolated. Such cells 
may be obtained by using expression vectors which carry nucleic acid sequences which 
encode hOAT polypeptides. The expression vectors may contain viral origins of 
replication and/or endogenous expression elements. Further, such expression vectors may 
carry nucleic acid sequences which encode for a selectable marker. 

A nucleic acid molecule, such as DNA, is said to be "capable of expressing" a 
polypeptide if it contains nucleotide sequences which contain transcriptional and 
translational regulatory information and such sequences are "operably linked" to 
nucleotide sequences which encode the polypeptide. An operable linkage is a linkage in 
which the regulatory DNA sequences and the DNA sequence sought to be expressed are 
connected in such a way as to permit gene sequence expression. The precise nature of the 
regulatory regions needed for gene sequence expression may vary from organism to 
organism, but shall in general include a promoter region which, in prokaryotes, contains 
both the promoter (which directs the initiation of RNA transcription) as well as the DNA 
sequences which, when transcribed into RNA, will signal synthesis initiation. Such 
regions will normally include those 5*-non-coding sequences involved with initiation of 
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transcription and translation, such as the TATA box, capping sequence, CAAT sequence, 
and the like. 

If desired, the non-coding region 3' to the sequence encoding an hOAT 
polypeptide of the invention may be retained for its transcriptional termination regulatory 
5 sequences, such as termination and polyadenylation. Thus, by retaining the 3'-region 
naturally contiguous to the DNA sequence encoding an hOAT polypeptide of the 
invention, the transcriptional termination signals may be provided. Where the 
transcriptional termination signals are not satisfactorily functional in the expression host 
cell, then a 3* region functional in the host cell may be substituted. 

10 Two DNA sequences (such as a promoter region sequence and a sequence 

encoding an hOAT polypeptide of the invention) are said to be operably linked if the 
nature of the linkage between the two DNA sequences does not (1) result in the 
introduction of a frame-shift mutation, (2) interfere with the ability of the promoter region 
sequence to direct the transcription of a gene sequence encoding an hOAT polypeptide of 

15 the invention, or (3) interfere with the ability of the gene sequence of an hOAT 

polypeptide of the invention to be transcribed by the promoter region sequence. Thus, a 
promoter region would be operably linked to a DNA sequence if the promoter were 
capable of effecting transcription of that DNA sequence. Thus, to express a gene 
encoding an hOAT polypeptide of the invention, transcriptional and translational signals 

20 recognized by an appropriate host are necessary. 

The present invention encompasses the expression of a gene encoding an hOAT 
polypeptide of the invention (or a functional derivative thereof) in either prokaryotic or 
eukaryotic cells. Prokaryotic hosts are, generally, very efficient and convenient for the 
production of recombinant proteins and for the propagation of clones. Prokaryotes most 

25 frequently are represented by various strains of E, coli. However, other microbial strains 
may also be used, including other bacterial strains. 

In prokaryotic systems, plasmid vectors that contain replication sites and control 
sequences derived from a species compatible with the host may be used. Examples of 
suitable plasmid vectors may include pBR322, pUCl 1 8, pUCl 19 and the like; suitable 

30 phage or bacteriophage vectors may include Xgtl 0, A,gtl 1 and the like; and suitable virus 
vectors may include pMAM-neo, pKRC and the like. Preferably, the selected vector of the 
present invention has the capacity to replicate in the selected host cell. 
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Recognized prokaryotic hosts include bacteria such as E. coli 9 Bacillus, 
Streptomyces, Pseudomonas, Salmonella, Serratia, and the like. However, under such 
conditions, the polypeptide will not be glycosylated. The prokaryotic host must be 
compatible with the replicon and control sequences in the expression plasmid. 
5 To express an hOAT polypeptide of the invention (or a functional derivative 

thereof) in a prokaryotic cell, it is necessary to operably link the sequence encoding the 
hOAT polypeptide of the invention to a functional prokaryotic promoter. Such promoters 
may be either constitutive or, more preferably, regulatable (i.e. 9 inducible or 
derepressible). Examples of constitutive promoters include the int promoter of 

10 bacteriophage X, the bla promoter of the JJ-lactamase gene sequence of pBR322, and the 
cat promoter of the chloramphenicol acetyl transferase gene sequence of pPR325, and the 
like. Examples of inducible prokaryotic promoters include the major right and left 
promoters of bacteriophage X (Pl and Pr), the trp, recA, XacZ, AacI, and gal promoters of 
E. coli, the a-amylase (Ulmanen et al. 9 (1985) J. BacterioL 162:176-182), and the 

15 promoters of the bacteriophages of Bacillus (Gryczan, In: The Molecular Biology of the 
Bacilli, Academic Press, Inc., NY, 1982), and Streptomyces promoters (Ward et a/., 
(1986) Mol Gen. Genet. 203:468-478). Prokaryotic promoters are reviewed by Glick 
(1987, Ind. MicrobioU 1:277-282), Cenatiempo (1986, Biochimie 68:505-516), and 
Gottesman (1 984, Ann. Rev. Genet. 1 8:415-442). 

20 Proper expression in a prokaryotic cell also requires the presence of a ribosome- 

binding site upstream of the gene sequence-encoding sequence* Such ribosome-binding 
sites are disclosed, for example, by Gold et aL (1981, Ann. Rev. Microbiol. 35:365-404). 
The selection of control sequences, expression vectors, transformation methods, and the 
like, are dependent on the type of host cell used to express the gene. As used herein, 

25 "ceir, "cell line", and "cell culture" may be used interchangeably and all such 

designations include progeny. Thus, the words * < transfonnants" or "transformed cells" 
include the primary subject cell and cultures derived therefrom, without regard to the 
number of transfers. It is also understood that all progeny may not be precisely identical 
in DNA content, due to deliberate or inadvertent mutations. However, as defined, mutant 

30 progeny have the same functionality as that of the originally transformed cell. 

Host cells which may be used in the expression systems of the present invention 
are not strictly limited, provided that they are suitable for use in the expression of the 
hOAT polypeptide of interest. Especially suitable hosts for expressing the polypeptides of 
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the invention are eukaryotic cells* Preferred eukaryotic hosts include, for example, yeast, 
fungi, insect cells, amphibian oocytes, mammalian cells either in vivo, or in tissue culture. 
Mammalian cells which may be useful as hosts include 3T3-L1 preadipocytes, cells of 
fibroblast origin such as VERO or CHO-K1, or cells of lymphoid origin and their 
5 derivatives. Preferred mammalian host cells include HeLa, EcR293, SP2/0 and J558L, as 
well as neuroblastoma cell lines such as IMR 332, which may provide better capacities for 
correct post-translational processing. 

In addition, plant cells are also available as hosts, and control sequences 
compatible with plant cells are available, such as the cauliflower mosaic virus 35S and 

10 19S, and nopaline synthase promoter and polyadenylation signal sequences. Another 
preferred host is an insect cell, for example the Drosophila larvae. Using insect cells as 
hosts, the Drosophila alcohol dehydrogenase promoter can be used (Rubin, (1988) Science 
240:1453-1459). Alternatively, baculovirus vectors can be engineered to express large 
amounts of hOAT polypeptides of the invention in insect cells (Jasny, (1987) Science 

15 238:1653; Miller et aL 9 In: Genetic Engineering, Vol. 8, Plenum; Setlow et aL 9 eds., pp. 
277-297, 1986). 

Any of a series of yeast expression systems can be utilized which incorporate 
promoter and termination elements from the actively expressed sequences coding for 
glycolytic enzymes that are produced in large quantities when yeast are grown in mediums 

20 rich in glucose. Known glycolytic gene sequences can also provide very efficient 

transcriptional control signals. Yeast provides substantial advantages in that it can also 
carry out post-translational modifications. A number of recombinant DNA strategies exist 
utilizing strong promoter sequences and high copy number plasmids which can be utilized 
for production of the desired proteins in yeast. Yeast recognizes leader sequences on 

25 cloned mammalian genes and secretes peptides bearing leader sequences (i.e., pre- 
peptides). Several possible vector systems are available for the expression of hOAT 
polypeptides of the invention in a mammalian host. 

A wide variety of transcriptional and translational regulatory sequences may be 
employed, depending upon the nature of the host. The transcriptional and translational 

30 regulatory signals may be derived from viral sources, such as adenovirus, bovine 

papilloma virus, cytomegalovirus, simian virus, or the like, where the regulatory signals 
are associated with a particular gene sequence which has a high level of expression. 
Alternatively, promoters from mammalian expression products, such as actin, collagen, 
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myosin, and the like, may be employed. Transcriptional initiation regulatory signals may 
be selected which allow for repression or activation, so that expression of the gene 
sequences can be modulated. Of interest are regulatory signals which are temperature- 
sensitive so that by varying the temperature, expression can be repressed or initiated, or 
5 are subject to chemical (such as metabolite) regulation. 

Expression of hOAT polypeptides of the invention in eukaryotic hosts requires the 
use of eukaryotic regulatory regions. Such regions will, in general, include a promoter 
region sufficient to direct the initiation of RNA synthesis. Preferred eukaryotic promoters 
include, for example, the promoter from the immediate-early gene of human 

10 cytomegalovirus (CMV) (Boshart M. et al., (1985) Cell 2:521-30, the promoter of the 
mouse metallothionein I gene sequence (Hamer et aL 9 (1982) J. MoL AppL Gen. 1:273- 
288); the TK promoter of Heipes virus (McKnight, (1982) Cell 31 :355-365); the SV40 
early promoter (Benoist et aL 9 (1981) Nature (London) 290:304-31); and the yeast gal4 
gene sequence promoter (Johnston et al y (1982) Proc. Natl Acad. ScL (USA) 79:6971- 

15 6975; Silver et al. y (1984) Proc. Natl. Acad. ScL (USA) 81:5951-5955). 

Translation of eukaryotic mRNA is initiated at the codon which encodes the first 
methionine. For this reason, it is preferable to ensure that the linkage between a 
eukaryotic promoter and a DNA sequence which encodes an hOAT polypeptide of the 
invention (or a functional derivative thereof) does not contain any intervening codons 

20 which are capable of encoding a methionine (i.e., AUG). The presence of such codons 

results either in the formation of a fusion protein (if the AUG codon is in the same reading 
frame as the hOAT polypeptide of the invention coding sequence) or a frame-shift 
mutation (if the AUG codon is not in the same reading frame as the hOAT polypeptide of 
the invention coding sequence). 

25 A nucleic acid molecule encoding an hOAT polypeptide of the invention and an 

operably linked promoter may be introduced into a recipient prokaryotic or eukaryotic cell 
either as a nonreplicating DNA or RNA molecule, which may either be a linear molecule 
or, more preferably, a closed covalent circular molecule. Since such molecules are 
incapable of autonomous replication, the expression of the gene may occur through the 

30 transient expression of the introduced sequence. Alternatively, permanent expression may 
occur through the integration of the introduced DNA sequence into the host chromosome. 

A vector may be employed which is capable of integrating the desired gene 
sequences into the host cell chromosome. Cells which have stably integrated the 
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introduced DNA into their chromosomes can be selected by also introducing one or more 
markers which allow for selection of host cells which contain the expression vector. The 
marker may provide for prototrophy to an auxotrophic host, biocide resistance, e.g. 9 
antibiotics, or heavy metals, such as copper, or the like. The selectable marker gene 
5 sequence can either be directly linked to the DNA gene sequences to be expressed, or 

introduced into the same cell by co-transfection. Additional elements may also be needed 
for optimal synthesis of mRNA. These elements may include splice signals, as well as 
transcription promoters, enhancers, and termination signals. cDNA expression vectors 
incorporating such elements include those described by Okayama (1983, MoL Cell. Biol. 
10 3:280-289). 

The introduced nucleic acid molecule can be incorporated into a plasmid or viral 
vector capable of autonomous replication in the recipient host. Any of a wide variety of 
vectors may be employed for this purpose. Factors of importance in selecting a particular 
plasmid or viral vector include: the ease with which recipient cells that contain the vector 

15 may be recognized and selected from those recipient cells which do not contain the vector; 
the number of copies of the vector which are desired in a particular host; and whether it is 
desirable to be able to "shuttle" the vector between host cells of different species. 

Preferred prokaryotic vectors include plasmids such as those capable of replication 
in E. coli (such as, for example, pBR322, ColEl, pSClOl, pACYC 184; "Molecular 

20 Cloning: A Laboratory Manual", 1989, supra). Bacillus plasmids include pC194, pC221, 
pT127, and the like (Gryczan, In: The Molecular Biology of the Bacilli, Academic Press, 
NY, pp. 307-329, 1982). Suitable Streptomyces plasmids include pi J101 (Kendall et al. 9 
(1987) J. Bacterid. 169:4177-4183), Pseudomonas plasmids are reviewed in John et al. 
(1986) Rev. Infect. Dis. 8:693-704), and Izaki (1978) J. Bacteriol. 33:729-742). 

25 Preferred eukaryotic plasmids include, for example, pcDNA3.1 (Invitrogen), BPV, 

vaccinia, SV40, 2-micron circle, and the like, or their derivatives. Such plasmids are well 
known in the art (Botstein et al. 3 (1982) Miami Wntr. Symp. 19:265-274; Broach, In: 'The 
Molecular Biology of the Yeast Saccharomyces: Life Cycle and Inheritance", Cold Spring 
Harbor Laboratory, Cold Spring Harbor, NY, p. 445-470, 1981; Broach, (1982) Cell 

30 28:203-204; Bollon et al. y (1980J J. Clin. Hematol Oncol. 10:39-48; Maniatis, In: Cell 
Biology: A Comprehensive Treatise, Vol. 3, Gene Sequence Expression, Academic Press, 
NY, pp. 563-608, 1980). 
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Once the vector or nucleic acid molecule containing the constructs) has been 
prepared for expression, the DNA construct(s) may be introduced into an appropriate host 
cell by any of a variety of suitable means, i.e., transformation, transfection, conjugation, 
protoplast fusion, electroporation, particle gun technology, calcium phosphate- 
5 precipitation, direct microinjection, and the like. After the introduction of the vector, 
recipient cells are grown in a selective medium, which selects for the growth of vector- 
containing cells. Expression of the cloned gene(s) results in the production of an hOAT 
polypeptide of the invention, or fragments thereof. This can take place in the transformed 
cells as such, or following the induction of these cells to differentiate (for example, by 

10 administration of bromodeoxyuracil to neuroblastoma cells or the like). A variety of 

incubation conditions can be used to form the peptide of the present invention. The most 
preferred conditions are those which mimic physiological conditions. 

Following the introduction of the vector, cells may be allowed to grow for 1-2 days 
in an enriched media before they are switched to selective media. The purpose of the 

15 selectable marker is to confer resistance to selection, and its presence allows growth and 

recovery of cells which successfully express the introduced sequences. Resistant clones of 
stably transformed cells may be proliferated using tissue culture technique appropriate to 
the cell type. 

Cell lines stably expressing human organic anion transporters under the control of 
20 an inducible promoter can also be isolated. For inducible expression of human organic 
anion transporters, the ecdysone system from Invitrogen (San Diego, CA) was used. 
Ecdysone is a hormone that regulates metamorphosis in insects, such as the fruitfly 
Drosophila. Like other steroid hormones, ecdysone exerts its effects by binding to a 
nuclear receptor, the VgR. The hormone-receptor complex then binds to a sequence of 
25 DNA called the ecdysone response element (EcRE), and activates transcription from 
promoters containing these elements. Since mammalian cells do not express ecdysone 
receptors, they normally do not respond to the insect hormone. However, if a piece of 
DNA under the control of an ecdysone inducible promoter is trans feet ed into a mammalian 
cell line, treating the cell line with ecdysone will induce expression of the said DNA. 
30 The cloned transporters can also be expressed in the amphibian oocytes, a widely 

used heterologous system for the expression of membrane proteins. Complementary RNA 
can be synthesized from purified cDNA in vitro if it is downstream of an appropriate 
promoter, such as the T3 or T7 phage promoter. The synthesized RNA are microinjected 
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into each frog oocyte, which translates the message and expresses the protein in its 
membranes. Xenopus oocytes can be obtained by surgical removal of ovaries from the 
* frog Xenopus laevis. The ovary lobes are treated with collagenase to remove the vitalin 
layer. Oocytes can be maintained for up to two weeks in Barth solution ( 88 mM NaCl, 1 
5 mM KC1, 2.4 mM NaHC0 3 , 15 mM HEPES (pH 7.6), 0.7 mM CaCl 2 , 0.82 mM MgS0 4 
and 50 pg/ml gentamicin) at 16°C. Typically, about 50 nl or 10-30 ng of RNA are 
injected into each oocyte. The injected oocytes are maintained at 16°C for 2-3 days and 
then subjected to transport assays. 
Assays for functional expression 

10 Assaying for functional expression of transporters can be done using a labeled 

compound that is a known substrate of the transporter. For hOATl,/?-aminohippurate 
(PAH) is a high affinity substrate that is rapidly transported. Transfected cells or injected 
oocytes are washed briefly with media or Barth buffer and then incubated with the labeled 
PAH. After the desired incubation time, the excess label is washed away by adding cold 

15 buffer. The cells are then solubilized with SDS and the amount of label taken up can be 
determined by scintillation counting. 

Other methods for measuring transport activity of organic anion transporters are 
well known in the art and are applicable to the present invention. For example, transport 
assays utilizing membrane vesicles was demonstrated by Yamazaki, et ah, (1997) Drug 

20 Metabolism and Disposition, 1 123-1 129; and Pascolo, et al. (1998) Biochem J.33 1:99- 103. 



Cell lines stably expressing hOAT polypeptides used to identify substrates, inhibitors 
and modulators of hOAT polypeptides 

Cell lines stably expressing hOAT polypeptides can be used to screen for other 

25 substrates or inhibitors of hOAT polypeptides through use of a competition assay. If a test 
compound is well transported by an hOAT protein, it will compete with a substrate, 
known to be transported by a particular hOAT, for access to the transporter. Cells 
expressing hOAT proteins would then uptake less of the known substrate in the presence 
than in the absence of the competitor. 

30 If a test compound competes with transport of a known substrate, further testing is 

necessary to confirm that the test compound is a substrate or inhibitor. Such confirmation 
can be done using techniques known to those skilled in the art, for example, the 
intracellular concentration of the test compound can be measured using HPLC. If a test 
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compound, which competes with transport of a known substrate, is also found to 
accumulate inside a cell, this confirms that the test substrate is a substrate and not an 
inhibitor of the particular hOAT being tested. 

Modulators can be identified by adding a test compound to cell lines stably 
5 expressing hOAT polypeptides and measuring whether the addition of the test compound 
increases or decreases the transport of hOAT substrates. If a modulator is identified, 
which decreases or inhibits the transport of a substrate, further testing, using techniques 
known to those skilled in the art, is necessary to confirm that such modulators are not 
actually competitive inhibitors. One such technique involves adding more substrate in a 
10 competition assay to determine if said substrate can compete with the inhibitory effects of 
the putative modulator. If adding more substrate has no affect on the modulating activity 
then the modulator is not a competitive inhibitor. 



Nucleic Acid Probes. Methods, and Kits for Detection of the hOAT Polypeptides 

15 A nucleic acid probe of the present invention may be used to probe an appropriate 

chromosomal or cDNA library by usual hybridization methods to obtain other nucleic acid 
molecules of the present invention. A chromosomal DNA or cDNA library may be 
prepared from appropriate cells according to recognized methods in the art (cf. "Molecular 
Cloning: A Laboratory Manual", second edition, Cold Spring Harbor Laboratory, 

20 Sambrook, Fritsch, & Maniatis, eds., 1989). 

In the alternative, chemical synthesis can be carried out in order to obtain nucleic 
acid probes having nucleotide sequences which correspond to N-tenninal and C-terminal 
portions of the amino acid sequence of the polypeptide of interest. The synthesized 
nucleic acid probes may be used as primers in a polymerase chain reaction (PCR) carried 

25 out in accordance with recognized PCR techniques, essentially according to PCR 

Protocols, "A Guide to Methods and Applications", Academic Press, Michael, et a/., eds., 
1990, utilizing the appropriate chromosomal or cDNA library to obtain the fragment of the 
present invention. 

One skilled in the art can readily design such probes based on the sequence 
30 disclosed herein using methods of computer alignment and sequence analysis known in 
the art ( 4c Molecular Cloning: A Laboratory Manual", 1 989, supra). The hybridization 
probes of the present invention can be labeled by standard labeling techniques such as 
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with a radiolabel, enzyme label, fluorescent label, biotin-avidin label, chemiluminescence, 
and the like. After hybridization, the probes may be visualized using known methods. 

The nucleic acid probes of the present invention include RNA, as well as DNA 
probes, such probes being generated using techniques known in the art. The nucleic acid 
5 probe may be immobilized on a solid support. Examples of such solid supports include, 
but are not limited to, plastics such as polycarbonate, complex carbohydrates such as 
agarose and sepharose, and acrylic resins, such as polyacrylamide and latex beads. 
Techniques for coupling nucleic acid probes to such solid supports are well known in the 
art. 

10 The test samples suitable for nucleic acid probing methods of the present invention 

include, for example, cells or nucleic acid extracts of cells, or biological fluids. The 
samples used in the above-described methods will vary based on the assay format, the 
detection method and the nature of the tissues, cells or extracts to be assayed. Methods for 
preparing nucleic acid extracts of cells are well known in the art and can be readily 

1 5 adapted in order to obtain a sample that is compatible with the method utilized. 

One method of detecting the presence of nucleic acids of the invention in a sample 
comprises (a) contacting the sample with the above-described nucleic acid probe under 
conditions such that hybridization occurs, and (b) detecting the presence of the probe 
bound to the nucleic acid molecule. One skilled in the art would select the nucleic acid 

20 probe according to techniques known in the art as described above. Samples to be tested 
include but should not be limited to RNA samples of human tissue. 

A kit for detecting the presence of nucleic acids of the invention in a sample 
comprises at least one container means having disposed therein the above-described 
nucleic acid probe. The kit may further comprise other containers comprising one or more 

25 of the following: wash reagents and reagents capable of detecting the presence of bound 
nucleic acid probe. Examples of detection reagents include, but are not limited to 
radiolabeled probes, enzymatic labeled probes (horseradish peroxidase, alkaline 
phosphatase), and affinity labeled probes (biotin, avidin, or streptavidin). Preferably, the 
kit further comprises instructions for use, 

30 

Antibodies, Hybridomas, Methods of Use and Kits for Detection of the hOAT 
Polypeptides 
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The present invention relates to an antibody having binding affinity to an hOAT 
polypeptide of the invention. The polypeptide may have the amino acid sequence encoded 
by the nucleic acid sequence set forth in SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3, 
SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, or a functional derivative thereof, or the 
5 amino acid sequence set forth in SEQ ID NO:7 or SEQ ID NO:8» SEQ ID NO:9, SEQ ID 
NO: 10, SEQ ID NO:l 1, SEQ ID NO: 12, or a functional derivative thereof, or at least 9 
contiguous amino acids thereof (preferably, at least 20, 30, 40, 50 or 60 contiguous amino 
acids thereof). 

The present invention also relates to an antibody having specific binding affinity to 
10 an hOAT polypeptide of the invention. Such an antibody may be isolated by comparing 
its binding affinity to an hOAT polypeptide of the invention with its binding affinity to 
other polypeptides. Those which bind selectively to an hOAT polypeptide of the 
invention would be chosen for use in methods requiring a distinction between an hOAT 
polypeptide of the invention and other polypeptides, including other polypeptides with 
15 similar amino acid sequences. Such methods could include, but should not be limited to, 
the analysis of altered hOAT polypeptide expression in tissue containing other 
polypeptides. 

The hOAT polypeptides of the present invention can be used to produce antibodies 
or hybridomas. One skilled in the art will recognize that if an antibody is desired, such a 

20 peptide could be generated as described herein and used as an immunogen. The antibodies 
of the present invention include monoclonal and polyclonal antibodies, as well fragments 
of these antibodies, and humanized forms. Humanized forms of the antibodies of the 
present invention may be generated using one of the procedures known in the art such as 
chimerization or CDR grafting. 

25 The present invention also relates to hybridomas that produce the above-described 

monoclonal antibodies, or binding fragment thereof. A hybridoma is an immortalized cell 
line that is capable of secreting a specific monoclonal antibody. 

In general, techniques for preparing monoclonal antibodies and hybridomas are 
well known in the art (Campbell, "Monoclonal Antibody Technology: Laboratory 

30 Techniques in Biochemistry and Molecular Biology/* Elsevier Science Publishers, 
Amsterdam, The Netherlands, 1984; St. Groth et aL, (1980), J. Immunol. Methods 
35:1-21). Any animal (mouse, rabbit, and the like) which is known to produce antibodies 
can be immunized with the selected polypeptide. Methods for immunization are well 
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known in the art. Such methods include subcutaneous or intraperitoneal injection of the 
polypeptide. One skilled in the art will recognize that the amount of polypeptide used for 
immunization will vary based on the animal that is immunized, the antigenicity of the 
polypeptide and the site of injection. 
5 The polypeptide may be modified or administered in an adjuvant in order to 

increase the peptide antigenicity. Methods of increasing the antigenicity of a polypeptide 
are well known in the art. Such procedures include coupling the antigen with a 
heterologous protein (such as globulin or p-galactosidase) or through the inclusion of an 
adjuvant during immunization. 

1 0 For monoclonal antibodies, spleen cells from the immunized animals are removed, 

fused with myeloma cells, such as SP2/0-Agl4 myeloma cells, and allowed to become 
monoclonal antibody producing hybridoma cells. Any one of a number of methods well 
known in the art can be used to identify the hybridoma cell that produces an antibody with 
the desired characteristics. These include screening the hybridomjts with an ELISA assay, 

1 5 western blot analysis, or radioimmunoassay (Lutz et al, ( 1 988) Exp. Cell Res. 175:1 09- 
124). Hybridomas secreting the desired antibodies are cloned and the class and subclass 
are determined using procedures known in the art (Campbell, (1984) "Monoclonal 
Antibody Technology: Laboratory Techniques in Biochemistry and Molecular Biology", 
supra). 

20 For polyclonal antibodies, antibody-containing antisera is isolated from the 

immunized animal and is screened for the presence of antibodies with the desired 
specificity using one of the above-described procedures. The above-described antibodies 
may be detectably labeled. Antibodies can be detectably labeled through the use of 
radioisotopes, affinity labels (such as biotin, avidin, and the like), enzymatic labels (such 

25 as horse radish peroxidase, alkaline phosphatase, and the like) fluorescent labels (such as 
FITC or rhodamine, and the like), paramagnetic atoms, and the like. Procedures for 
accomplishing such labeling are well-known in the art, for example, see Stemberger et al 9 
(1970) J. Histochem. Cytochem. 18:315; Bayer et aU (1979) Meth. Enzym. 62:308; Engval 
et al, (1972) Immunol 109:129; Goding, J. (1976) Immunol Meth. 13:215). The labeled 

30 antibodies of the present invention can be used for in vitro, in vivo, and in situ assays to 
identify cells or tissues that express a specific peptide. 

The above-described antibodies may also be immobilized on a solid support. 
Examples of such solid supports include plastics such as polycarbonate, complex 

39 



JSDOCID: <WO 0104283A2 I > 



WO 01/04283 PCT/US00/18980 

carbohydrates such as agarose and sepharose, acrylic resins such as polyacrylamide and 
latex beads. Techniques for coupling antibodies to such solid supports are well known in 
the art (Weir et aL 9 "Handbook of Experimental Immunology** 4th Ed., Blackwell 
Scientific Publications, Oxford, England, Chapter 10, 1986; Jacoby et al 9 (1974) Meth. 
5 Enzym. 34, Academic Press, N.Y.,). The immobilized antibodies of the present invention 
can be used for in vitro, in vivo, and in situ assays as well as in immunochromotography. 

The present invention also encompasses a method of detecting an hOAT 
polypeptide in a sample, comprising: (a) contacting the sample with an above-described 
antibody, under conditions such that immunocomplexes form, and (b) detecting the 

10 presence of said antibody bound to the polypeptide. In detail, the methods comprise 

incubating a test sample with one or more of the antibodies of the present invention and 
assaying whether the antibody binds to the test sample. 

Conditions for incubating an antibody with a test sample vary. Incubation 
conditions depend on the format employed in the assay, the detection methods employed, 

15 and the type and nature of the antibody used in the assay. One skilled in the art will 

recognize that any one of the commonly available immunological assay formats (such as 
radioimmunoassays, enzyme-linked immunosorbent assays, diffusion based Ouchterlony, 
or rocket immunofluorescent assays) can readily be adapted to employ the antibodies of 
the present invention. Examples of such assays can be found in Chard ("An Introduction 

20 to Radioimmunoassay and Related Techniques" Elsevier Science Publishers, Amsterdam, 
The Netherlands, 1 986), Bullock et aL ('Techniques in Immunocytochemistry," Academic 
Press, Orlando, FL Vol. 1, 1982; Vol. 2, 1983; Vol. 3, 1985), Tijssen ('^Practice and 
Theory of Enzyme Immunoassays: Laboratory Techniques in Biochemistry and Molecular 
Biology," Elsevier Science Publishers, Amsterdam, The Netherlands, 1985). 

25 The immunological assay test samples of the present invention include cells, 

protein or membrane extracts of cells, or biological fluids such as blood, serum, plasma, or 
mine* The test samples used in the above-described method will vary based on the assay 
format, nature of the detection method and the tissues, cells or extracts used as the sample 
to be assayed. Methods for preparing protein extracts or membrane extracts of cells are 

30 well known in the art and can readily be adapted in order to obtain a sample which is 
testable with the system utilized. 

A kit contains all the necessary reagents to carry out the previously described 
methods of detection. The kit may comprise: (i) a first container means containing an 
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above-described antibody, and (li) second container means containing a conjugate 
comprising a binding partner of the antibody and a label. Preferably, the kit also contains 
instructions for use. In another preferred embodiment, the kit further comprises one or 
more other containers comprising one or more of the following: wash reagents and 
5 reagents capable of detecting the presence of bound antibodies. 

Examples of detection reagents include, but are not limited to, labeled secondary 
antibodies, or in the alternative, if the primary antibody is labeled, the chromophoric, 
enzymatic, or antibody binding reagents that are capable of reacting with the labeled 
antibody. The compartmentalized kit may be as described above for nucleic acid probe 
10 kits. One skilled in the art will readily recognize that the antibodies described in the 

present invention can readily be incorporated into one of the established kit formats that 

are well known in the art. 

Examples 

Example 1 : Identification of human EST clones homologous to rat organic 
1 5 anion transporter proteins 

The cloning of rat kidney organic anion transporter OAT1 (Sekine, et al. 9 (1997) J. 
Biol. Chem. 30: 18526-18529) and liver transporter OAT2 (Sekine, et al., (1998) FEBS 
Letters 429: 179-182) has been previously reported. The published amino acid sequences 
of OAT 1 and OAT2 from rat were used as input sequences to perform a BLAST search of 
20 human expressed sequence tags (EST) sequences deposited in Genbank. The BLAST 
search identified several sequences that show amino acid homology to rat OAT1 and 
OAT2. 

EST clones accession Nos. R25797 and AI0 16020 show high homology to rat 
OAT1 and OAT2, respectively. The clones isolated using EST clones R25797 and 
25 Al 01 6020 were named hOATl and hOAT2, respectively. The other ESTs show lower 
homology to rat OAT1 and OAT2. These other EST clones have been named hOAT3, 
hOAT4, and hOAT5. The latter three clones are not considered human homologues to the 
rat OAT1 and OAT2. 
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Example 2 : Isolation of full-length human OAT1 cDNA 
The human EST clone R25797, listed in Example 1, was purchased from a 
distributor of IMAGE Consortium (LLNL) cDNA Clones (Lennon, et aL, Genomics 33: 
151-52 (1996)). EST cDNA clone R25797 was sequenced to verify its homology to rat 
5 OAT1 . The cDNA insert was excised by Notl and HindUI digestion and purified by 
agarose gel electrophoresis. These DNA fragments were then labeled with P-a- 
deoxyadenosine triphosphate (3000 Ci/mmol) using the Prime-a-Gene labeling system 
(Promega, Madison, WI) to a specific activity greater than 1 x 10 9 dpm/jig. 

A human kidney cDNA library in pTripleEx vector (Clontech, Palo Alto, CA) was 

10 plated out on 10 cm-diameter plates at a density of 5 x 10 4 pfu/plate. The plates were 
incubated at 37°C for 6-8 hours until each plaque was about 0.3-0.5 mm in diameter. 
Plaque lifts were performed by placing a nylon membrane on each plate for 2 minutes. 
The nylon membrane was then lifted carefully from the plate and immersed in 1-5 M NaCl 
and 0.5 M NaOH for 2 minutes to denature the attached DNA. The nylon was then placed 

15 in 1 .5 M NaCl and 0.5 M Tris (pH 8.0) for neutralization. Finally, the membrane was 
quickly rinsed in 2X SSC (0.3 M NaCl and 30 mM sodium citrate). The attached DNA 
was crosslinked by UV irradiation. Filters were incubated in hybridization buffer with 
radioactive probes at a concentration of 1 x 10 6 dpm/ml in roller bottles at 65°C overnight 
The next day, the filters were washed with 2X SSC and 0.1% SDS for 15 minutes at 25°C 

20 for two times. They were then washed with 0.2X SSC and 0.1% SDS for 1 hour at 65°C. 
After the final wash, filters were exposed to autoradiographic film with an intensifying 
screen at -80°C overnight. Positive clones, which show up as dark spots on the autorad, 
were identified and a 3 mm- diameter agar plug was removed from the original plate. 

The positive clone was purified by subsequent rounds of hybridization until a well- 

25 isolated plaque could be obtained. The cDNA of interest was excised from the phage into 
a phagmid by an in vivo excision procedure using BM25.8 cells as host (Clontech, Palo 
Alto, CA). Phagmids were amplified, purified, and sequenced using standard procedures. 
The DNA sequence was analyzed for open reading frames and compared to published 
sequences. 

30 One positive plaque isolated from the above procedure was 2.5 kb in length. 

Analysis of its sequence revealed an open-reading frame with a deduced amino acid 
sequence showing high homology to the rat OAT1 protein (Table 1). This clone was 
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designated human OAT1 (hOATl), or the human homologue of rat OAT1. The entire 
cDNA and deduced amino acid sequences of human OAT1 are shown in Figure 1. 

The isolated cDNA encoding for hOATl is predicted to encode a polypeptide with 
a length of 550 amino acids. Computer analysis of the deduced amino acid encoded by 
hOATl cDNA reveals consensus protein kinase C (PKC) phosphorylation sites at amino 
acids 271,278, and 284. 

Computer analysis using the GAP- Alignment program from the Wisconsin 
Genetics Software Package indicates that hOATl polypeptide is 88% identical to rat 
OAT1 polypeptide and 39% identical to rat OAT2 polypeptide (Table 1). GAP analysis 
also indicates that hOATl is 86% identical to rat OAT1 nucleotide sequence (Table 1). 

EST accession number R25797 is homologous to hOATl cDNA at nucleotides 
398-2100 (Figure 1). 



Tablel 



Homology Among Human and Rat Organic Anion Transporters 





hOATl 
N 1 (A) 2 


hOAT2A 
N (A) 


hOAT2B 
N (A) 


hOAT3 
N (A) 


hOAT4 
N (A) 


hOATS 
N (A) 


rat OAT1 


86% (88%) 


53% (37%) 


53% (37%) 


60% (50%) 


48% (36%) 


51% (40%) 


ratOAT2 


53% (39%) 


82% (79%) 


82% (79%) 


51% (37%) 


44% (30%) 


46% (31%) 


hOATl 




54% (39%) 


55% (39%) 


61% (51%) 


49% (37%) 


50% (40%) 


hOAT2A 






99% (98%) 


52% (38%) 


44% (32%) 


45% (32%) 


hOAT2B 








52% (39%) 


44% (33%) 


45% (32%) 


hOAT3 










52% (37%) 


52% (39%) 


hOAT4 












72% (55%) 


hOAT5 















15 



20 



25 



1 (N) indicates the percentage of identical nucleotides found between compared 
nucleic 

acid sequences as determined using the GAP- Alignment program in the GCG 
software 

package, using a gap weight of 5.0 and a length weigh of 0.3. 

2 (A) indicates the percentage of identical amino acids found between compared 
polypeptides as determined using the GAP-Alignment program in the GCG 

software 

package, using a gap weight of 3.0 and a length weight of 0. 1 . 
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Example 3 : Isolation of full-length human OAT2A and OAT2B cDNA 
Human OAT2 cDNA was cloned in the same manner as described for human 
OAT1 cDNA in Example 1 except that EST accession number-A1016020 was used as a 
probe. In addition, hOAT2A and hOAT2B were isolated from a cDNA library obtained 
5 from human liver rather than kidney, as was hOATl. 

The entire cDNA and deduced amino acid sequences of human OAT2A are shown 
in Figure 2. Human OAT2A (hOAT2A) cDNA is predicted to encode a polypeptide that 
is 546 amino acids in length. Computer analysis of the deduced amino acid sequence 
using the GAP-Alignment program indicates that human OAT2A polypeptide is 79% 
10 identical to rat OAT2 polypeptide (Table 1). GAP analysis also indicates that the 
nucleotide sequence for hOAT2A is 82% identical to the nucleotide sequence of rat 
OAT2. 

EST accession number Al 01 6020 is homologous to hOAT2A cDNA at nucleotide 
sequences 20-1400 (Figure 2). 
15 The entire cDNA and deduced amino acid sequences of human OAT2B (hOAT2B) 

are shown in Figure 3. Human OAT2B cDNA is predicted to encode a polypeptide that is 
538 amino acids in length. Human OAT2B is identical to hOAT2A except at its C- 
terminus end. Without wishing to be bound by any particular theory, hOAT2B could be 
an mRNA splice variant of hOAT2A. 

20 

Example 4 : Isolation of full-length human OAT3 cDNA 
Human OAT3 (hOAT3) cDNA was isolated as described for hO ATI cDNA in 
Example 1 except that EST accession number-H41333 was used as a probe. The entire 
cDNA and deduced amino acid sequences of human OAT3 are shown in Figure 4. 
25 Computer analysis of the hOAT3 cDNA sequence predicts that hOAT3 cDNA encodes a 
polypeptide that is 542 amino acids in length. Computer analysis of the deduced amino 
acid sequence reveals consensus PKC phosphorylation sites at amino acids 259, 266, 269, 
511, and 527. 

EST accession number H41333 is homologous to hOAT3 cDNA at nucleotides 
30 1169-2121 (Figure 4). 
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Example 5 : Isolation of full-length human OAT4 cDNA 
Human OAT4 (hOAT4) cDNA was cloned in the same manner as described for 
human OAT1 cDNA in Example 1 except that EST accession number-AA705512 was 
used as a probe. In addition, hOAT4 cDNA was isolated from a cDNA library obtained 
from human liver rather than kidney, as was hOATL The entire cDNA and deduced 
amino acid sequences of human OAT4 are shown in Figure 5. Computer analysis of 
human OAT4 cDNA predicts that hOAT4 cDNA encodes a polypeptide that is 554 amino 
acids in length. Computer analysis of the deduced amino acid sequence reveals a 
consensus PKC phosphorylation site at amino acid 324. 

EST accession number AA705512 is homologous to hOAT4 cDNA at nucleotides 
1-1232 (Figure 5). 



Example 6 : Isolation of full-length hOAT5 cDNA 

Human OATS (hOAT5) cDNA was cloned in the same manner as described for 
human OAT1 cDNA in Example 1 except that EST accession number-AAA705512 was 
used as a probe. In addition, hOAT5 cDNA was isolated from a cDNA library obtained 
from human liver rather than kidney, as was hOATl . The entire cDNA and deduced 
amino acid sequences of human OATS are shown in Figure 6. Human OATS cDNA is 
predicted to encode a polypeptide that is 54 1 amino acids in length. Computer analysis of 
the deduced amino acid sequence reveals consensus PKC phosphorylation sites at amino 
acids 282, 289, 345, and 526. 

EST accession number AA705512 is homologous to hOATS cDNA at nucleotides 
1-1193 (Figure 6). 



Example 7 : Functional expression of human OAT1 in HeLa cells 
Human OAT1 cDNA was subcloned into the mammalian expression vector 
pcDNA3.1 (Invitrogen, San Diego, CA), which contains a cytomegalovirus promoter. The 
plasmid construct (pcDNA-hOATl) was then amplified and purified. The construct was 
transfected into HeLa cells (ATCC), cultured in 25 mm 2 flasks, using the Effectene 
transfection reagent (Qiagen, Stanford Santa Clarita, CA). Control cell lines were 
transfected with a pcDNA-lacZ construct. One day post-transfection, the cells were split 
into six (6)-well plates and cultured for another twenty-four (24) hours. Transfected cells 
were tested for transport of 3 H-/>-aminohippurate, a classical substrate of the kidney 
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organic anion transporter. Since OAT1 is an antiporter, the transfected cells were first 
preloaded with 1 mM glutarate for one (1) hour. The glutarate-containing media was 
removed and replaced with media containing 50 jiM PAH at 1 pCi/ml. After ten (10) 
minutes at 37°C, the media was aspirated and the cells were washed three times with ice 
5 cold phosphate-buffered saline (PBS). The cells were then solubilized with 0.5% SDS. 
The amount of labeled PAH transported into the cells was determined by a scintillation 
counter. As shown in Fig. 7, HeLa cells transfected with hOATl were able to uptake PAH 
whereas HeLa cells transfected with a vector control were only able to uptake a very small 
amount of PAH (almost 10-fold less than cells transfected with a vector containing DNA 
10 encoding hOATl). 

Example 8 : Functional expression of hOATl in Xenopus oocytes 
Complementary hOATl RNA was synthesized in vitro from the pcDNA-hOATl 
plasmid using T7 RNA polymerase (Ambion, Austin, TX). The transcription reaction was 
performed at 37°C for 2 hours. The RNA was purified by phenol and chloroform 
extraction and precipitated with ammonium acetate and ethanol. Purified RNA was 
analyzed for size and purity by agarose gel electrophoresis. RNA was resuspended in 
water at a concentration of 0.3 fig^l- Oocytes were prepared from ovaries removed from 
female Xenopus laevis according to White et al., (1985) PNAS 82:4852-56. About 15 ng 
of hOATl cRNA in 50 nl was injected into each oocyte. As a control, 50 nl of water was 
injected into each oocyte. The injected oocytes were assayed for transport of PAH two (2) 
days post-injection. Oocytes were preloaded with 1 mM glutarate for 2 hours in Barth 
buffer. Transport assays were conducted in modified Barth buffer (1 00 mM NaCl, 2mM 
KC1, 1 mM CaCl 2 , 1 mM MgCl 2 , 10 mM HEPES, pH 7.5) with 50 nM PAH at 4 ^Ci/ml. 
After a one (1) hour incubation at 25°C, oocytes were washed with 3 ml ice cold transport 
assay buffer three (3) times. Each oocyte was solubilized in 0.5 ml 0.5% SDS and 
subjected to scintillation counting. As shown in Fig. 8, oocytes injected with the hOATl 
RNA uptake 3 H-PAH to a much greater extent than water injected oocytes. 

30 Example 9 : Identification of fluorescein as a substrate for hOATl 

Human embryonic kidney (HEK293) cells were transfected with pcDNA-hOATl, 
as in Example 7. One day after transfection the cells were assayed for transport of 
fluorescein or SITS. Ten (10) ^M of either substrate was added to the external media for 
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ten (10) minutes at 37°C. The cells were washed with ice-cold PBS to remove excess 
label and analyzed under a fluorescent microscope. Cells that take up either fluorescein or 
SITS are predicted to turn fluorescent green. Fluorescent microscopy revealed that 
hO ATI -expressing cells took up fluorescein but not SETS. 



Example 10 : Competition assay to identify substrates of hOATl using 
fluorescein 

HEK293 cells were transiently transfected with pcDNA3-hOATl as in Example 7 
and assayed for transport of fluorescein, but in the presence or absence of 0.5 mM PAH. 
10 Since fluorescein and PAH are both substrates of hOATl, they will compete for access to 
the transporter. As evidenced by fluorescent microscopy, hOATl -expressing cells showed 
uptake of fluorescein in the absence but not in the presence of external PAH. 

Example 11 : Establishment of Cell lines stably expressing hOATs. 

15 Vector pIND(spl) (Invitrogen) contains the ecdysone-inducible promoter (EcRE). 

Using standard methods known to those in the art, cDNAs encoding hOATl, hOAT2A, 
into vector pIND/spl downstream of its EcRE promoter. After cloning, the vectors were 
separately transfected into cell line EcR293 (Invitrogen). The EcR293 cell line is a 
derivative of cell line HEK 293 and contains a stably integrated pVgRXR vector. The 

20 pVgRXR vector constitutively expresses the ecdysone receptor and the retinoid X receptor 
(RXR). The ecdysone receptor and RXR form a heterodimer which, upon activation by 
the hormone ecdysone, binds to a promoter containing the EcRE. The pVgRXR vector 
was maintained within the EcR293 cell line by selection with 400 tig/ml zeocin. The 
pIND(spl)-hOAT plasmids contain a neomycin resistance marker, and cells stably 

25 integrating the plasmids were selected for by culturing the cells in media supplemented 
with 400 fig/ml G418. After two weeks of selection, individual colonies appeared and 
were isolated and expanded. To test for expression of the transporter, the cells were 
induced with 1.25 iig/ml ponasterone, an ecdysone analogue, for twenty-four (24) hours. 
The cells were then subjected to standard transport assays. 

30 EcR293 cells with stably integrated pIND(spl)-hOATl expressed hOATl at a 

basal level that was not harmful to the cells. These cells grow at a rapid rate comparable 
to the parental EcR293 cells. Hence, the hOATl cDNA can be maintained within these 
cells indefinitely. Expression of hOATl at high levels can be easily achieved by inducing 
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with ponasterone for twenty- four (24) hours. As shown in Fig. 9, ECR293 cells 
containing hOATl in plasmid pIND(sp-l) uptake approximately 50-fold more PAH when 
induced by ponasterone than non-induced cells containing pIND(sp-l)-hOATl. ECR293 
cells stably containing hOAT2A, in plasmid pIND(sp-l), uptake approximately 2.6-fold 
5 more 3 H-PAH when induced by ponasterone than non-induced cells (Fig. 10). Cell lines 
stably expressing hOAT2B, hOAT3, hOAT4, and hOAT5 can be established using the 
same method as for hOATl and hOAT2A. 

Example 12 : Competition assay to identify new substrates of hOAT2, using a 

10 radiolabeled substrate 

EcR293/OAT2A cells were assayed for transport of 3 H-PAH as in Example 7. The 
uptake of 3 H-PAH was blocked by 100 jiM sulfobromophthalein (BSP) but not 500 \xM a- 
ketoglutarate (AKG) (Fig. 11). The competition assay thus identified BSP and not AKG 
as a preferred substrate of hOAT2A. 

15 

Example 13 : Predicting pharmacokinetics in man using a panel of hOAT 
expressing cells 

A panel of different cell lines, each expressing a particular human OAT protein, 
can be used to predict pharmacokinetics for anionic drugs. For instance, PAH is a 

20 compound transported rapidly by hOATl and to a much lesser extent by the other human 
OATs disclosed in the invention. The panel of hOAT expressing cells would predict that 
PAH is eliminated rapidly by the human kidney, as previously described. Similarly, the 
competition assays in Examples 10 and 12 showed that fluorescein is a very good 
substrate of human OAT1 and that BSP is a good substrate of human OAT2A. Therefore, 

25 the assay would predict that fluorescein is eliminated from the human body mainly via 
kidney excretion and that BSP is mainly eliminated from the human body via liver 
excretion. 

Further confirmation that a compound is a substrate of an hOAT would be done 
using in vivo tissue distribution assays. Compounds found to be transported by hOATs, 
30 using the cell line described above, would be injected into animals and followed to 

confirm their transport by the appropriate hOAT. For instance, a compound found to be 
transported by a cell line stably expressing hOATl, would be expected to be excreted in 
the urine of an animal injected with the said compound. Similarly, a compound found to 
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be transported by a cell line expressing hOAT2, would be expected to be transported into 
the liver of animals injected with said compound. 

Since hOAT4 and 5 have homology to OCTs, their range of of substrates may 
include organic cations. Therefore, these transporters may be used to predict the 
pharmacokinetics of certain cationic drugs as well, using the methods described herein. 



Example 14 : Identification of hOAT substrates using antiporter activity. 

EcR293/hOATl cells are incubated with H-a-ketoglutarate for an hour to preload 
with labeled ct-ketoglutarate. A test substrate is added to the cells. After a 10 minute 
1 0 incubation, an aliquot of the supernatant is removed and measured for effluxed a- 

ketoglutarate by scintillation counting. Substrates of hOATl will show trans-stimulation 
of ct-ketoglutarate exchange. Inhibitors or non-substrates of hOATl will not stimulate a- 
ketoglutarate efflux. 



15 Example 15 : Cytoxicity assays using cell lines stably expressing hOAT 

polypeptides 

EcR293 cells stably transformed with hOAT polypeptides can be used to screen for 
compounds potentially toxic to certain organs or tissues through use of a cytoxicity assay. 
Test compounds can be added to cell lines stably expressing hOAT polypeptides. 
20 Compounds transported by hOAT polypeptides and which are toxic will cause cell death. 
Cell death can be measured or determined using techniques well known in the art, such as 
visual inspection, microscopic inspection, and dye exclusion. Using dye exclusion, 
healthy cells are able to exclude a particular dye whereas injured or dead cells are unable 
to exclude the dye. 

25 The foregoing examples are not limiting and are merely representative of various 

aspects and features of the present invention. All references referred to above are 
incorporated herein by reference. 

Example 16 : Functional expression of hOAT3 in Xenopus oocytes 
Complementary hOAT3 RNA was synthesized in vitro from the pcDNA-hOAT3 
30 plasmid using T7 RNA polymerase (Ambion, Austin, TX). The transcription reaction was 
performed at 37°C for 2 hours. The RNA was purified by phenol and chloroform 
extraction and precipitated with ammonium acetate and ethanol. Purified RNA was 
analyzed for size and purity by agarose gel electrophoresis. RNA was resuspended in 
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water at a concentration of 0.5 |ig/|*l. Oocytes were prepared from ovaries removed from 
female Xenopus laevis according to White et al., (1985) PNAS 82:4852-56. About 25 ng 
of hOAT3 cRNA in 50 nl was injected into each oocyte. As a control, 50 nl of water was 
injected into each oocyte. The injected oocytes were assayed for transport of PAH two (2) 
days post-injection. Transport assays were conducted in modified Barth buffer (100 mM 
NaCl, 2mM KC1, 1 mM CaCl 2 , 1 mM MgCl 2 , 10 mM HEPES, pH 7.5) with 50 piM PAH 
at 10 nCi/ml. After a one (1) hour incubation at 25°C» oocytes were washed with 3 ml ice 
cold transport assay buffer three (3) times. Each oocyte was solubilized in 0.5 ml 0.5% 
SDS and subjected to scintillation counting. As shown in Fig. 12, oocytes injected with 
the hOAT3 RNA uptake 3 H-PAH to a much greater extent than water injected oocytes. 

Example 17 : Determination of tissue distribution of hOATs 
The individual hOAT cDNAs were labeled with 32 P-ATP to a specific activity of > 
1 x 10 9 dpm/fig using the Strip-EZ kit™ (Ambion). The Strip-EZ kit™ allows easy 
15 removal of probes from a blot and the blot can thus be reprobed several times without 
substantial loss of signal. The Human Multiple Tissue Northern blot was obtained from 
Clontech, which contained mRNA extracted from 12 different human tissues. The blot 
was successively hybridized with each hOAT probe in 5 ml of ExpressedHyb buffer 
(Clontech) at a final concentration of 3-5 x 10 6 dpm/ml. Hybridization was carried out at 
20 65°C for 4 hrs. The blot was washed two times with 2X SSC and 0.1% SDS at 65°C for 
1 5 min each, and then once with 0. IX SSC and 0. 1% SDS at 65°C for 1 hr. The blot was 
then exposed to BioMax MR autographic film (Kodak) with intensifying screen at -80°C 
for 24 hrs. 

The northern analysis shows that hOATl was expressed only in human 
25 kidney (FIG 13). The OAT1 probe hybridized to a major band at about 2.4 kb. 

The hOAT2 probe hybridized to two major bands at about 2.4 and about 3 kb from 
liver and kidney, with higher levels in the liver than kidney (FIG 13). HOAT3 was 
expressed only in the kidney, with a major transcript of about 2.4 kb and minor 
transcript at about 4.3 kb (FIG 13). HOAT4 and hOAT5 were expressed 
30 exclusively in the liver (FIG 1 3). hOAT4 probe hybridized to a major band at 

about 4.3 kb and hOAT5 probe hybridized to a major band at about 2,5 kb. Hence, 
northern analyses indicated that hOATl and 3 are mainly kidney transporters and 

i 
i 
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hOAT4 and 5 are liver transporters, whereas hOAT2 is a liver and kidney 
transporter. 

All patents and publications mentioned in the specification are indicative of the 
levels of skill of those skilled in the art to which the invention pertains. All references 
5 cited in this disclosure are incorporated by reference to the same extent as if each 
reference had been incorporated by reference in its entirety individually. None of the 
references are admitted to be prior art. 

One skilled in the art would readily appreciate that the present invention is well 
adapted to carry out the objects and obtain the ends and advantages mentioned, as well as 
10 those inherent therein. The methods and compositions described herein as presently 
representative of preferred embodiments are exemplary and are not intended as limitations 
on the scope of the invention. Changes therein and other uses will occur to those skilled in 
the art, which are encompassed within the spirit of the invention, are defined by the scope 
of the claims. 

15 It will be readily apparent to one skilled in the art that varying substitutions and 

modifications may be made to the invention disclosed herein without departing from the 
scope and spirit of the invention. Thus, such additional embodiments are within the scope 
of the present invention and the following claims. 

The invention illustratively described herein suitably may be practiced in the absence 

20 of any element or elements, limitation or limitations which is not specifically disclosed 
herein. Thus, for example, in each instance herein any of the terms "comprising", 
"consisting essentially of* and "consisting of* may be replaced with either of the other 
two terms. The terms and expressions which have been employed are used as terms of 
description and not of limitation, and there is no intention that in the use of such terms and 

25 expressions of excluding any equivalents of the features shown and described or portions 
thereof, but it is recognized that various modifications are possible within the scope of the 
invention claimed. Thus, it should be understood that although the present invention has 
been specifically disclosed by preferred embodiments, optional features, modification and 
variation of the concepts herein disclosed may be resorted to by those skilled in the art, 

30 and that such modifications and variations are considered to be within the scope of this 
invention as defined by the description and the appended claims. 

In addition, where features or aspects of the invention are described in 
terms of Markush groups or other grouping of alternatives, those skilled in the art 
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will recognize that the invention is also thereby described in terms of any 
individual member or subgroup of members of the Markush group or other group. 

Other embodiments are within the following claims. 
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What is claimed is: 

1 . An isolated, enriched, or purified nucleic acid molecule comprising a 
nucleic acid molecule selected from the group consisting of SEQ ID NO:2, SEQ ID NO:3, 
SEQ ID NO:4, SEQ ID NO:5, and SEQ ID NO:6. 

2. An isolated, enriched, or purified nucleic acid molecule comprising a 
nucleic acid molecule which encodes a human OAT polypeptide selected from the group 
consisting of SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NOrl 1, and SEQ ID 
NO:12. 

3. An isolated, enriched, or purified nucleic acid molecule which is 
complementary to any of the nucleic acid molecules of claims I and 2. 

4. The isolated, enriched, or purified nucleic acid molecule of claim 1, 
15 wherein said nucleic acid molecule is SEQ ID NO:2. 

5. The isolated, enriched, or purified nucleic acid molecule of claim 1, 
wherein said nucleic acid molecule is SEQ ID NO:3. 

20 6. The isolated, enriched, or purified nucleic acid molecule of claim 1, 

wherein said nucleic acid molecule is SEQ ID NO:4. 

7. The isolated, enriched, or purified nucleic acid molecule of claim 1, 
wherein said nucleic acid molecule is SEQ ID NO:5. 

25 

8. The isolated, enriched, or purified nucleic acid molecule of claim 1, 
wherein said nucleic acid molecule is SEQ ID NO:6. 

9. The isolated, enriched, or purified nucleic acid molecule of claim 2, 
30 wherein said nucleic acid molecule encodes a polypeptide that is SEQ ID NO:8. 

10. The isolated, enriched, or purified nucleic acid molecule of claim 2, 
wherein said nucleic acid molecule encodes a polypeptide that is SEQ ID NO:9. 

1 1 . The isolated, enriched, or purified nucleic acid molecule of claim 2, 
35 wherein said nucleic acid molecule encodes a polypeptide that is SEQ ID NO: 10. 
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12. The isolated, enriched, or purified nucleic acid molecule of claim 2, 
wherein said nucleic acid molecule encodes a polypeptide that is SEQ ID NO:l 1. 



13. The isolated, enriched, or purified nucleic acid molecule of claim 2, 
5 wherein said nucliec acid molecule encodes a polypeptide that is SEQ ID NO: 12. 

14. An isolated nucleic acid molecule comprising a nucleotide sequence 

selected from the group consisting of: 

(a) a nucleotide sequence which encodes a naturally occurring variant 
10 of a polypeptide comprising the amino acid sequence of SEQ ID NO:8, wherein the 

nucleotide sequence hybridizes to a nucleic acid molecule comprising SEQ ID NO:2 under 

stringent conditions; 

(b) a nucleotide sequence which encodes a naturally occurring variant 

of a polypeptide comprising the amino acid sequence of SEQ ID NO:9, wherein the 
15 nucleotide sequence hybridizes to a nucleic acid molecule comprising SEQ ID NO:3 under 

stringent conditions; 

(c) a nucleotide sequence which encodes a naturally occurring variant 
of a polypeptide comprising the amino acid sequence of SEQ ID NO:10, wherein the 
nucleotide sequence hybridizes to a nucleic acid molecule comprising SEQ ID NO:4 under 

20 stringent conditions; 

(d) a nucleotide sequence which encodes a naturally occurring variant 
of a polypeptide comprising the amino acid sequence of SEQ ID NO:l 1, wherein the 
nucleotide sequence hybridizes to a nucleic acid molecule comprising SEQ ID NO:5 under 

stringent conditions; and 
25 (e) a nucleotide sequence which encodes a naturally occurring variant 

of a polypeptide comprising the amino acid sequence of SEQ ID NO: 12, wherein the 
nucleotide sequence hybridizes to a nucleic acid molecule comprising SEQ ID NO:6 under 

r 

stringent conditions. 
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15. An isolated nucleic acid molecule comprising a nucleotide sequence that 
has at least 83% identity to nucleotide sequence contained in SEQ ID NO:2, wherein the 
percent identity is calculated using the GAP- Alignment program in the GCG software 
package, using a gap weight of 5.0 and a length weight of 0.3. 

5 

16. An isolated nucleic acid molecule comprising a nucleotide sequence that 
has at least 83% identity to nucleotide sequence contained in SEQ ID NO:3, wherein the 
percent identity is calculated using the GAP- Alignment program in the GCG software 
package, using a gap weight of 5.0 and a length weight of 0.3. 

10 

17. An isolated nucleic acid molecule comprising a nucleotide sequence that 
has at least 61% identity to nucleotide sequence contained in SEQ ID NO:4, wherein the 
percent identity is calculated using the GAP- Alignment program in the GCG software 
package, using a gap weight of 5.0 and a length weight of 0.3. 

15 

18. An isolated nucleic acid molecule comprising a nucleotide sequence that 
has at least 60% identity to nucleotide sequence contained in SEQ ID NO: 5, wherein the 
percent identity is calculated using the GAP- Alignment program in the GCG software 
package, using a gap weight of 5.0 and a length weight of 0.3. 

20 

19. An isolated nucleic acid molecule comprising a nucleotide sequence that 
has at least 60% identity to nucleotide sequence contained in SEQ ID NO:6, wherein the 
percent identity is calculated using the GAP -Alignment program in the GCG software 
package, using a gap weight of 5.0 and a length weight of 0.3. 

25 

20. An isolated nucleic acid molecule comprising a nucleotide sequence 
encoding a polypeptide which is at least 80% percent identical to the amino acid sequence 
of SEQ ID NO:8 or 9, wherein said percent identity is calculated using the GAP program 
in the GCG software package, using a gap weight of 3.0 and a length weight of 0.1. 

30 
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21. An isolated nucleic acid molecule comprising a nucleotide sequence 
encoding a polypeptide which is at least 70% percent identical to the amino acid sequence 
of SEQ ID NO: 10, wherein said percent identity is calculated using the GAP program in 
the GCG software package, using a gap weight of 3.0 and a length weight of 0.1. 

5 

22. An isolated nucleic acid molecule comprising a nucleotide sequence 
encoding a polypeptide which is at least 70% percent identical to the amino acid sequence 
of SEQ ID NO:l 1 , wherein said percent identity is calculated using the GAP program in 
the GCG software package, using a gap weight of 3.0 and a length weight of 0.1. 

10 

23. An isolated nucleic acid molecule comprising a nucleotide sequence 
encoding a polypeptide which is at least 70% percent identical to the amino acid sequence 
of SEQ ID NO: 12, wherein said percent identity is calculated using the GAP program in 
the GCG software package, using a gap weight of 3.0 and a length weight of 0.1. 

15 

24. The nucleic acid molecule of claim 1 or 2, further comprising a vector or 
promoter effective to initiate transcription in a host cell. 

25. The nucleic acid molecule of claim 24, wherein said promoter comprises an 
20 inducible promoter. 

26. A nucleic acid probe for the detection, isolation, purification, enrichment, 
or amplification of a nucleic acid molecule encoding human OAT in a sample, wherein 
said nucleic acid probes are selected from the group of nucleic acid sequences set forth in, 

25 SEQ ID NO:2 from nucleotide 1370- 1638, SEQ ID NO: 3 from nucleotide 1370-1614, 
SEQ ID NO:4: from nucleotide 1-1 107, SEQ ID NO: 5 from nucleotide 995-1662 and 
SEQ ID NO:6 from nucleotide 993-1623. 

27. A nucleic acid probe for the detection, isolation, purification, enrichment, 
30 or amplification of a nucleic acid molecule encoding human OAT in a sample, wherein 

said nucleic acid probes are selected from the group of nucleic acid sequences encoding 
the polypeptide fragment selected from the group consisting of amino acids 132-136 of 
SEQ ID NO:8, amino acids 132-136 of SEQ ID NO:9, amino acids 1 12-116 of SEQ ID 
NO:10, amino acids 136-140 of SEQ ID NO:l 1 and amino acids 135-139 of SEQ ID 
35 NO:12. 
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28. A recombinant cell comprising a nucleic acid molecule encoding a human 
OAT polypeptide wherein said nucleic acid molecule is selected from the group consisting 
of SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, and SEQ ID NO:6. 

5 

29. The cell of claim 28, wherein said cell is stably transformed. 

30. The cell of claim 28 or 29, wherein said cell is a mammalian cell. 

10 31. The cell of claim 28 or 29, wherein said polypeptide is a fragment of the 

polypeptide encoded by the amino acid sequence encoded by the nucleic acid sequence set 
forth in SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, or the 
amino acid sequence set forth in SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID 
NO:ll,or SEQIDNO:12. 



15 



20 



25 



30 



32. The cell of claim 28 or 29, wherein said nucleic acid is expressed from an 
inducible promoter. 

33. The cell of claim 32, wherein said promoter is inducible with ecdysone. 

34. An isolated, enriched, or purified human OAT polypeptide encoded by the 
nucleic acid sequence set forth in SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID 
NO:5, or SEQ ID NO:6, or the amino acid sequence set forth in SEQ ID NO:8, SEQ ID 
NO:9, SEQ ID NO:10, SEQ ID NO:l 1, or SEQ ID NO:12. 

35. The polypeptide of claim 34, wherein said polypeptide is a fragment of the 
polypeptide encoded by the nucleic acid sequence set forth in SEQ ID NO:2, SEQ ID 
NO:3, SEQ ID NO:4, SEQ ID NO:5, or SEQ ID NO:6, or the amino acid sequence set 
forth in SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:l 1, or SEQ ID NO:12, 



36. The polypeptide of claim 34, wherein said polypeptide is isolated, purified, 
or enriched from a cell that comprises an endogenous nucleic acid molecule that encodes 
said polypeptide. 
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37. The polypeptide of claim 34, wherein said polypeptide is isolated, purified, 
or enriched from a cell that is transformed with a nucleic acid molecule that encodes said 
polypeptide, 

5 38. The polypeptide of claim 34, wherein said polypeptide is chemically 

» 

synthesized. 

39. A labeled or unlabeled antibody or antibody fragment having specific- 
binding affinity to a human OAT polypeptide encoded by the nucleic acid sequence set 

10 forth in SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, or SEQ ID NO:6, or 
the amino acid sequence set forth in SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO:10, SEQ 
ID NO:l 1, or SEQ ID NO:12. 

40. A hybridoma which produces an antibody having specific binding affinity 
15 to a human OAT polypeptide encoded by the nucleic acid sequence set forth in SEQ ID 

NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, or SEQ ID NO:6, or the amino acid 
sequence set forth in SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:l 1, or 
SEQ ID NO: 12. 

20 41 . A method for identifying modulators of human OAT2A, human OAT2B, 

human OAT3, human OAT4, or human OATS polypeptide, said method comprising: 

(a) contacting said hOAT polypeptide with a test substance; 

(b) adding a substrate, 

(c) measuring the activity of said hOAT polypeptide; and 

25 (d) if said substance decreases transport, determining whether said test 

substance is a modulator or competitive inhibitor of said hOAT polypeptide. 

42. A method for identifying a modulator of human OAT2A, human OAT2B, 
human OAT3, human OAT4, or human OATS polypeptide in a cell, said method 
30 comprising: 

(a) expressing an hOAT polypeptide in a cell; 

(b) contacting said ceil with a test substance; 

(c) adding a substrate; 

(d) measuring the activity of said hOAT polypeptide; and 
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(e) if said substance decreases transport, determining whether said test 
substance is a modulator or competitive inhibitor of said hOAT. 



43. A method to screen for substrates or inhibitors of human OAT2A, human 
5 OAT2B, human OAT3, human OAT4, and human OATS using a cell stably transformed 

with an hOAT nucleic acid molecule and a competition assay, said method comprising: 

(a) adding a test substrate to a cell line expressing said human OAT 

polypeptides; 

(b) adding a substrate known to be transported by said human OAT 
10 polypeptides in said cell line; 

(c) measuring the ability of said test substrate to compete with the 
uptake of said substrate known to be transported by human OAT1, human OAT2A, human 
OAT2B, human OAT3, human OAT4, and human OAT5; and 

(d) confirming whether said test substrate is an inhibitor or substrate. 

15 

44. The method of claim 43, wherein said test substrate and/or known substrate 
are fluorescent or conjugated with a fluorescent labeled. 

45. The method of claim 44, wherein said assay is evaluated using high- 
20 throughput screening. 

46. A method for identifying substrates of human OAT polypeptides, said 
method comprising: 

(a) loading a cell line expressing an hOAT polypeptide with a 
25 substance know to be effiuxed from the interior of said cell as part of the antiporter 

activity of said hOAT polypeptide 

(b) adding a test substrate to said cell line; 

(c) measuring the efflux of said antiporter substance; and 

(d) determining if said test substrate affected the efflux of said 
30 antiporter substance. 

47. The cell of claim 28 or 29, wherein said cell is a yeast cell. 

48. The cell of claim 28 or 29, wherein said cell is an insect cell. 
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49. A method of preparing an hOAT polypeptide comprising: 

(a) culturing the recombinant cell of claim 29 under conditions that 
permit expression of the hOAT polypeptide; and 

(b) isolating said polypeptide. 

50. An amphibian oocyte containing a nucleic acid molecule encoding for a 
human OAT polypeptide according to claim 2. 

5 1 . The amphibian oocyte of claim 50 that is a Xenopus laevis oocyte. 

52. The nucleic acid molecules of claim 1 or 2, wherein said nucleic acid 
molecules comprise one or more regions that encode an hOAT polypeptide or an hOAT 
polypeptide fragment, where the hOAT polypeptide or the hOAT polypeptide fragment is 
fused to a non-hOAT polypeptide or epitope tag. 

53. The nucleic acid molecule of claim 52, wherein said non-hOAT 
polypeptide or epitope tag is selected from the group consisting of glutathione-S- 
transferase, green- fluorescent protein and an amino terminal tag composed of histidine 
residues. 



54. A method to identify toxic compounds transported by hOAT polypeptides 
comprising: 

(a) expressing an hOAT polypeptide in cells; 

(b) adding a test substrate to said cells; and 
25 (c) determining if said cells are viable. 
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I CTGAGCTGAeCTGACCCCCAAAGTGAAGGAGAAGCTGCAAGGGAAAAGGGAGGGACAGAT 

I 1 + + + + + + 60 

I GACTCGACTGGACTGGGGGTTTCACTTCCTCTTCGACGTTCCCTTTTCCCTCCCTGTCTA 

CAGGGAGACCGGGGAAGAAGGAGGAGCAGCCAAGGAGGCTGCTGTCCCCCCACAGAGCAG 
I 61 + + + -+ + + 120 

I GTCCCTCTGGCCCCTTCTTCCTCCTCGTCGGTTCCTCCGACGACAGGGGGGTGTCTCGTC 

I CTCGGACTCAGCTCCCGGAGCAACCCAGCTGCGGAGGCAACGGCAGTGCTGCTCCTCCAG 
Il21 + + + + + + 180 

I GAGCCTGAGTCGAGGGCCTCGTTGGGTCGACGCCTCCGTTGCCGTCACGACGAGGAGGTC 

I CGAAGGACAGCAGGCAGGCAGAGAGACAGAGGTCCTGGGACTGGAAGGCCTCAGCCCCCA 

1181 + + + + + + 240 

I GCTTCCTGTCGTCCGTCCGTCTGTCTGTCTCCAGGACCCTGACCTTCCGGAGTCGGGGGT 

I GCCACTGGGCTGGGCCTGGCCCAATGGCCTTTAATGACCTCCTGCAGCAGGTGGGGGGTG 
[241 + + + + + + 300 

J CGGTGACCCGACCCGGACCGGGTTACCGGAAATTACTGGAGGACGTCGTCCACCCCCXAC 

I MAFNDL L Q Q VGGV- 

I TCGGCCGCTTCCAGCAGATCCAGGTCACCCTGGTGGTCCTCCCCCTGCTCCTGATGGCTT 

hoi + + + + +- + 360 

J AGCCGGCGAAGGTCGTCTAGGTCCAGTGGGACCACCAGGAGGGGGACGAGGACTACCGAA 

GRFQQIQVTLVVLPLLLMAS- 

I CTCACAACACCCTGCAGAACTTCACTGCTGCCATCCCTACCCACCACTGCCGCCCGCCTG 
[361 + + + + + + 420 

I GAGTGTTGTGGGACGTCTTGAAGTGACGACGGTAGGGATGGGTGGTGACGGCGGGCGGAC 

HNTLQNFTAAI pthhcrppa- 

C C G ATGC CAAC C TCAGCAAGAA CGGGGGGCTGGAGGTCTGGCTGCC CCGGG ACAGGCAGG 
421 + + + + + + 480 

GGCTACGGTTGGAGTCGTTCTTGCCCCCCGACCTCCAGACCGACGGGGCCCTGTCCGTCC 
DANLSKNGGLEVWL P R D R Q G ~ 

GGCAGCCTGAGTCCTGCCTCCGCTTCACCTCCCCXX^AGTGGGGAC^ 

481 + +' + + + + 540 

CCGTCGGACTCAGGACGGAGGCGAAGTGGAGGGGCGTCACCCCTGACGGGAAAGAGTTAC 

I QPESCLRFTSPQWGLPFLN G - 

GCACAGAAGCCAATGGCACAGGGGCCACAGAGCCCTGCACCGATGGCTGGATCTATGACA 

541 + + + + + 600 

CGTGTCTTCGGTTACCGTGTCCCCGGTGTCTCGGGACGTGGCTACCGACCTAGATACTGT 

T EANGTGATEPCTDGW IYDN- 

ACAGCACCTTCCCATCTACCATCGTGACTGAGTGGGACCTTCTGTGCTCTCACAGGGCCC 
601 +-- + + + + + €60 
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V21 



781 



TCCTCTCGGGCATGGCTCTGGCTGGCATCTCCCTCAACTGCATGACACTGAATGTGGAGT 

841 + — -f + + + + + 

AGGAGAGCCCGT^CCGAGACCGACCGTAGAGGGAGTTGACGTACTGTGACTTACACCTCA 



TGTCGTGGAAGGqTAGATGGTAGCACTGACTCACCCTGGAACACACGAGAGTGTCCCGGG 



V T E W D L V 



H R A L - 



TACGCCAGCTGGCCCAGTCCTTGTACATGGTGGGGGTGCTGCTCGGAGCCATGGTGTTCG 



61 



720 



atgcggtcgaccgggtcaggaacatgtaccacccccacgacgagcctcggtaccacaagc 
rqlaIqslymvgvllgamvfg- 

I 

I 

GCTACCTTGCAG^CAGGCTAGGCCGCCGGAAGGTACTCATCTTGAACTACCTGCAGACAG 

+ L + + + + + 780 

CGATGGAACGTCTGTCCGATCCGGCGGCCTTCCATGAGTAGAACTTGATGGACGTCTGTC 



lad'rlgrrkvl 



LNYLQTA- 



ctgtgtcagggacctgcgcagccttcgcacccaacttccccatctactgcgccttccggc 

+—4 + + + + + 840 

GACACAGTCCCTCiGACGCGTCGGAAGCGTGGGTTGAAGGGGTAGATGACGCGGAAGGCCG 



GT'CAAFAPN 



R I* - 



900 



G M| A L A G 
I 



LNCMTLNVEW- 



!>0 



GGATGCCCATTC^CACACGGGCCTGCGTGGGCACCTTGATTGGCTATGTCTACAGCCTGG 

1 + — -j + + + + + 960 

CCTACGGGTAAGTGTGTGCCCGGACGCACCCGTGGAACTAACCGATACAGATGTCGGACC 



M 



HjTRACVGTLIG 
i 



V Y 



G - 



GCCAGTTCCTCCTGGCTGGTGTGGCCTACGCTGTGCCCCACTGGCGCCACCTGCAGCTAC 



j>61 



CGGTCAAGGAGGACCGACCACACCGGATGCGACACGGGGTGACCGCGGTGGACGTCGATG 



+ 1020 



I 



G V A Y A V 



H W R H 



L - 



) 



tggtctctgcgc<*:tttttttgccttctt^ 

21 + — t + + + + + 1080 

ACCAGAGACGCGdAAAAAAACGGAAGAAGTAGATGAGGACCAAGAAGTAACTCAGCCGGG 



V S 



FAFF 



Y S W F F 



E S A R - 



GCTGGCACTCCTCbCTCCGGGAGGCTGGACCTCACCCTGAGGGCCCTGCAGAGAGTCGCCC 

P81 ' + + + + + 1140 

CGACCGTGAGGAGGAGGCCCTCCGACCTGGAGTGGGACTCCCGGGACGTCTCTCAGCGGG 

r 

t 

i 

WHS S l SGRLDLTLRALQRVAR- 

i 

ggatcaatgggaAgcgggaagaaggagccaaattgagtatggaggtactccgggccagtc 



41 



+ 1200 
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CCTAGTTACCCTTCGCCCTTCTTCCTCGGTTTAACTCATACCTCCATGAGGCCCGGTCAG 
I NGKREEGAKL SMEVI*RASL- 

TGCAGAAGGAGCTGACCATGGGCAAAGGCCAGGCATCGGCCATGGAGCTGCTGCGCTGCC 
201 + + + + + + 1260 

ACGTCTTCCTCGACTGGTACCCGTTTCCGGTCCGTAGCCGGTACCTCGACGACGCGACGG 

QKELTMGKGQASAMEI/LRCP- 

CCACCCTCCGCCACCTCTTCCTCTGCCTCTCCATGCTGTGGTTTGCCACTAGCTTTGCAT 

261 + + + + + + 1320 

GGTGGGAGGCGGTGGAGAAGGAGACGGAGAGGTACGACACCAAACGGTGATCGAAACGTA 

TLRHLFLCLSMLWFATSFAY- 

ACTATGGGCTGGTCATGGACCTGCAGGGCTTTGGAGTCAGCATCTACCTAATCX!AGGTGA 

321 + + + + + + 1380 

TGATACCCGACCAGTACCTGGACGTCCCGAAACCTCAGTCGTAGATGGATTAGGTCCACT 

YGLVMDIiQGFGVS I Y L I Q V I - 

TCTTTGGTGCTGTGGACCTGCCTGCCAAGCTTGTGGGCTTCCTTGTCATCAACTCCCTGG 
81 + + + + + + 1440 

AGAAACCACGACACCTGGACGGACGGTTCGAACACCCGAAGGAACAGTAGTTGAGGGACC 
FGAVDI*PAKLVGFIjVINSLG- 



1 



GTCGCCGGCCTGCCCAGATGGCTGCACTGCTGCT 

141 + + + — + 1500 

CAGCGGCCGGACGGGTCTACCGACGTGACGACGACCGTCCGTAGACGTAGGACGAGTTAC 

RRPAQMAALLLAG ICILLNG- 



„. GGGTGATACCCCAGGACCAGTCCATTGTCCGAACCTCTCTTGCTGTGCTGGGGAAGGG 

ifeoi + + + + + + 1560 

CCCACTATGGGGTCCTGGTCAGGTAACAGGCTTGGAGAGAACGACACGACCCCTTCCCAA 

VI PQDQS IVRTSIiAVLGKGC- 

GTCTGGCTGCCTCCTTCAACTGC&TCTTCCTC 

561 + + + + + + 1620 

CAGACCGACGGAGGAAGTTGACGTAGAAGGACATATGACCCCTTGACATAGGGTGTTACT 

LAASFNC IFLYTGELYPTMI- 

TCCGGCAGACAGGCATGGGAATGGGCAGCACCATGGCCCGAGTGGGCAGCATCGTGAGCC 

521 + + + + + + 1680 

AGGCCGTCTGTCCGTACCCTTACCCGTCGTGGTACCGGGCTCACCCGTCGTAGCACTCGG 

RQT GMGMGSTMARVG S IVS P- 

CACTGGTGAGCATGACTGCCGAGCTCTACCCCTCCATGCCTCTCTTCATCTACGGTGCTC 
581 + ■+ + + + + 1740 
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GTGACCACTCGTACTGACGGCTCGAGATGGGGAGGTACGGAGAGAAGTAGATGCCACGAC 

LVSMTAELYPSMPLFIYGAV- 

TTCCTGTGGCCGCCAGCGCTGTCACTGTCCTCCTGCCAGAGACCCTGGGCCAGCCACTGC 

1741 + + + + + + 1800 

AAGGACACCGGCGGTCGCGACAGTGACAGGAGGACGGTCTCTGGGACCCGGTCGGTGACG 

PVAASAVTVIjL PETLGQPL P - 

CAGACACGGTGCAGGACCTGGAGAGCAGGAAAGGGAAACAGACGCGACAGCAACAAGAGC 

1801 + + + + + + 1860 

GTCTGTGCCACGTCCTGGACCTCTCGTCCTTTCCCTTTCTCTGCGCTGTCGTTGTTCTCG 

DTVQDLES RKGKQTRQQQEH- 

ACCAGAAGTATATGGTCCCACTGCAGGCCTCAGCACAAGAGAAGAATGGACTCTGAGGAC 

1861 + + + + ♦ + 1920 

TGGTCTTCATATACCAGGGTGACGTCCGGAGTCGTGTTCTCTTCTTACCTGAGACTCCTG 

QKYMVPLQASAQEKNGL* 

TGAGAAGGGGCCTTACAGAACCCTAAAGGGAGGGAAGGTCCTACAGGTCTCCGGCCACCC 

1921 + + + + + + 1980 

ACTCTTCCCCGGAATGTCTTGGGATTTCCCTCCCTTCCAGGATGTCCAGAGGCCGGTGGG 

ACACAAGGAGGAGGAAGAGGAAATGGTGACCCAAGTGTGGGGGTTGTGGTTCAGGAAAGC 

1981 + + + + + + 2040 

TGTGTTCCTCCTCCTTCTCCTTTACCACTC^^ 

ATCTTCCCAGGGGTC(^CCTCCCTTTATAAACCCCACCAGAACCACATCATTAAAAGGTT 

2041 + + + + + + 2100 

TAGAAGGGTCCCCAGGTGGAGGGAAATATTTGGGGTGGTCTTGGTGTAGTAATTTTCCAA 

TGACTGCGCACCAAAAAAAAAAAAAAA 

2101 + + 2127 

ACTGACGCGTGGTTTTTTTTTTTTTTT 
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M CTGCACCTGAAGCATTTGGTGGGTGAGCAGCATGGGCTTTGAGGAGCTGCTGGAGCAG6T 

1 + h + + + + 60 

GACGTGGACTTCGTAAACCACCCACTCGTCGTACCCGAAACTCCTCGACGACCTCGTCCA 

MGFEELLEQV- 

GGGCGGCTTTGGGCCCTTCCAACTGCGGAATGrGGCACTGCTGGCCCTIXSCCCCGAGTGCT 

61 + + + + + + 120 

CCCGCCGAAACCCGGGAAGGTTGACGCCTTACACCGTGACGACCGGGACGGGGCTCACGA 

GGFGPFQLRNVALLALiPRVL - 

GCTACCACTGCACTTCCTCCTGCCCATCTTCCTGGCTGCCX3TGCCTGCCCACCGATGTGC 

121 + + + + + + 180 

CGATGGTGACGTGAAGGAGGACGGGTAGAAGGACCGACGGCACXK3ACGGGTGGCTACACG 

LPIiHFLIiPIFLAAVPAHRCA- 

CCTGCCGGGTGCCCCTGCCAACTTCAGCCATCAGGATGTGTGGCTGGAGGCCCATCTTCC 

L81 + + + + + + 240 

GGACGGCCCACGGGGACGGTTGAAGTCGGTAGTCCTACACACCGACCTCCGGGTAGAAGG 

LPGA PANFSH QDVWIiEAHLP - 

CCGGGAGCCTGATGGC^CGCTCAGCTCCTGCCTCCGCTTTGCCTATCCCCAGGCTCTCCC 
41 + + + + + + 300 

GGCCCTCGGACTACCGTGCGAGTCGAGGACGGAGGCGAAACGGATAGGGGTCCGAGAGGG 



REPD. GTLSSCLRF AYPQALP- 

CAACACCACGTTGGGGGAAGAAAGGCAGAGCCGTGGGGAGCTGGAGGATGAACCTC 
01 + + + + + + 360 

GTTGTGGTGCAACCCCCTTCTTTCCGTCTC 

NTTLGEERQSRGELEDEPAT- 

AGTGCCCTGCTCTCAGGGCTGGGAGTACGACCACTCAGAATTCTCCTCTACCATTGCAAC 

61 + + + + + + 420 

TCACGGGACGAGAGTCCCGACCCTCATGCTGGTGAGTCTTAAGAGGAGATGGTAACGTTG 

VPCSQ GWEYDHSEFSST I A T - 

TGAGTGGGATCTGGTGTGTGAGCAGAAAGGTCTGAACAGAGCTGCGTCCACTTTCTTC 

21 + + + + + + 480 

ACTCACCCTAGACCAC^CACTCGTCTTTCCAGACTTGTCTCGACGCAGGTGAAAGAAGAA 

EWDLVCEQK GLNRAASTFFF- 

CX3CCGGTGTGCTGGTGGGGGCTGTGGCCTTTGGATATCTGTCCGACAGGTTTGGGCGGCG 
81 + + + + + + 540 

GCGGCCACACGACCACCCCCGACACCGGAAACCTATAGACAGGCTGTCCAAACCCGCCGC 
AGVLVGAVAFGYLSDRFGRR- 
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GCGTCTGCTGCTGGTAGCCTACGTGAGTACCCTGGTGCTGGGCCTGGCATCTGCAGCCTC 

541 + + + + + + 600 

CGCAGACGACGACCATCGGATGCACTCATGGGACCACGACCCGGACCGTAGACGTCGGAG 

RLLLVAYVSTLVLGLASAAS - 

CGTCAGCTATGTAATGTTTGCCATCACCCGCACCCTTACTGGC^ 

501 + + + + + + 660 

GCAGTCGATACATTACAAACGGTAGTGGGCGTGGGAATGACCGAGTCGGGACCGACCAAA 



VSYVMFAITRTLTGSALAGF- 

TACCATCATCGTGATGCCACTGGAGCTGGAGTGGCTGGATGTGGAGCACCGCACCGTGGC 
561 + + + + + + 720 

ATGGTAGTAGCACTACGGTGACCTCGACCTCACCGACCTACACCTCGTGGCGTGGCACCG 

T I IVMPLELEWLDVEHRTVA- 

TGGAGTCCTGAGCAGCACCTTCTGGACAGGGGGCGTGATGCTGCTGGCACTGGTTGGGTA 

721 + + + + + + 780 

ACCTCAGGACTCGTCGTGGAAGACCTGTCCCCCGCACTACGACGACCGTGACCAACCCAT 

GVLS STFWTGGVMLLALVGY - 

CCTGATACGGGACTGGCGATGGCTTCTGCTAGCTGTCACCCTGCCTTGTGCCCCAAGCAT 

781 + + + + + + 840 

GGACTATGCCCTGACCGCTACCGAAGACGATCGACAGTGGGACGGAACACGGGGTTCGTA 

L IRDWRWLLLAVTLPCAPS I - 

CCTCAGCCTCTGGTGGGTGCCTGAGTCTGCACGCTGGCTTCTGACCCAAGGCCATGTGAA 
341 + + + + + + 900 

GGAGTCGGAGACCACCCACGGACTCAGACGTGCGACCGAAGACTGGGTTCCGGTACACTT 

LSLWWVPESARWLLTQGHVK- 

AGAGGCCCACAGGTACTTGCTCCACTGTGCCAGGCTCAATGGGCGGCCAGTGTGTGAGGA 

)01 + + + + + + 960 

TCTCCGGGTGTCCATGAACGAGGTGACACGGTCCGAGTTACCCGCCGGTCACACACTCCT 

EAHRYLLHCARLNGRPVCED- 

GAGCTTCAGCCAGGAGGCTGTGAGCAAAGTGGCCGCCGGGGAACGGGTGGTC^ 

)61 + + + + + + 1020 

GTCGAAGTCGGTCCTCCGACACTCGTTTCACCGGCGGCCCCTO 

SFSQEAVSKVAAGERVVRRP- 

TTCATACCTAGACCTGTTCCGCACACCACGGCTCCGACACATCTCACTGTGCTGCG 

321 + + + + + + 1080 

AAGTATGGATCTGGACAAGGCGTGTGGTGCCGAGGCTGTGTAGAGTGACACGACGCACCA 

SYLDLFRTPRLRH I SLCCVV- 



FIGURE 2 
Sheet 2 of 5 



6/34 



WO 01/04293 PCT/USO 0/1 8980 

GGTGTGGTTCGGAGTGAACTTCTCCTATTACGGCCTGAGTCTGGATGTGTCGGGGCTGGG 
l]p81 + + + + + + 1140 

CCACACCAAGCCTCACTTGAAGAGGATAATGCCGGACTCAGACCTACACAGCCCCGACCC 

VWFGVNFSYYGLSLDVSGLG- 

GCTGAACGTGTACCAGACACAGCTGTTGTTCGGGGCTGTGGAACTGCCCTCCAAGCTGCT 
41 + + + + + + 1200 

CGACTTGCACATGGTCTGTGTCGACAACAAGCCCCGACACCTTGACGGGAGGTTCGACGA 

LNVYQTQLLFGAVELPSKLL- 

GGTCTACTTGTCGGTGCGCTACGGAGGACGCCGCCTCACGCAAGCCGGGACACTGCTGGG 
1#01 + + + + + + 1260 

CCAGATGAACAGCCACGCGATGCGTCCTGCGGCGGAGTGCGTTCGGCCCTGTGACGACCC 

VYLSVRYAGRRLTQAGTL LG- 

CACGGCCCTGGCGTTCGGCACTAGACTGCTAGTGTCCTCCGATATGAAGTCCTGGAGCAC 

ll$61 + + + + + + 1320 

GTGCCGGGACCGCAAGCCGTGATCTGACGATCACAGGAGGCTATACTTCAGGACCTCGTG 

TAI/AFGTRLLVSSDMKSWST- 

TGTCCTGGCAGTGATGGGGAAAGCTTTTTCTGAAGCTGCCTTCAGCACTGCCTACCTGTT 

j21 + + + + + + 1380 

ACAGGACCGTCACTACCCCTTTCGAAAAAGACTTCGACGGAAGTGGTGACGGATGGACAA 

VLAVMGKAFSEAAFTTAYLF- 

CACTTCAGAGTTGTACCCTACGGTGCTCAGACAGACAGGGATGGGGCTGACTGCACTGGT 

HB81 + + + + + + 1440 

GTGAAGTCTCAACATGGGATGCCACGAGTCTGTCTGTCCCTACCCCGACTGACGTGACCA 

TSELYPTVLRQTGMGLTALV- 

GGGCCGGCTGGGGGGCTCTTTGGCCCCACTGGCGGCCTTGCTGGATGGAGTGTGGCTGTC 

1|&41 + + + + + + 1500 

CCCGGCCGACCCCCCGAGAAACCGGGGTGACCGCCGGAACGACCTACCTCACACCGACAG 

GRLGGSLAPLAALLDGVWLS- 

ACTGCCCAAGCTTACTTATGGGGGGATCGCCCTGCTGGCTGCCGGCACCGCCCTCCTGCT 

501 + + + + + + 1560 

TGACGGGTTCGAATGAATACCCCCCTAGCGGGACGACCGACGGCCGTGGCGGGAGGACGA 

LPKLTYGGIALLAAGTALLL- 

GCCAGAGACGAGGCAGGCACAGCTGCCAGAGACCATCCAGGACGTGGAGAGAAAGAGTGC 

561 + + + + + + 1620 

CGGTCTCTGCTCCGTCCGTGTCGACGGTCTCTGGTAGGTCCTGCACCTCTCTTTCTCACG 

PETRQAQLPETIQDVER KSA- 
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WO 01/04283 _ flICAAH 0 _ 
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CCCAACCAGTCTTCAGGAGGAAGAGATGCCCATGAAGCAGGTCCAGAACTAAGTGGGAGT 

1621 + + + + + + 1680 

GGGTTGGTCAGAAGTCCTCCTTCTCTACGGGTACTTCGTC 

PTSLQEEEMPMKQVQN* 

GGAGGCAGGCCCTCCACAGAAGCTCTGCAGCAGGGGCTGGGAGAGCAGAAGGGCAGGCCC 

1681 + + + + + + 1740 

CCTCCGTCCGGGAGGTGTCTTCGAGACGTCGTCCCCGACCCTCTCGTCTTCCCGTCCGGG 

TGCAACTCAGGCTGGGAGTATCGAACCCTCTGCCTAGGGCCGGAGTTGCTGCCAGTACCC 

1741 + + + + + + 1800 

ACGTTGAGTCCGACCCTCATAGCTTGGGAGACGGATCCCGGCCTCAACGACGGTCATGGG 

GCTCCCTCTGCTCATCCATCCTTGATTATTTGGCTTCTAGGAACAGTTGACTTCCCAGAA 

1801 + + -+ + + + 1860 

CGAGGGAGACGAGTAGGTAGGAACTAATAAACCGAAGATCCTTGTCAACTGAAGGGTCTT 

TGCAGTGGGOTGCTGGGCACCCCTCTCACGGTTGGGGAGGATTCTGTAAATAAAGGTGCC 

1861 + + + + + + 1920 

ACGTCACCCGACGACCCGTGGGGAGAGTGCCAACCCCTCCTAAGACATTTATTTCCACGG 

CCTTGGGTTGGGGCAATGGTGACGAGCTGTGGGAAGAGCCCTGGATAGGAAGCCACTGAG 

!921 + + + + + + I960 

GGAACCC^CCCCGTTACC^CTGCTCGACACCCTTCTCGGGACCTATCCTTCGGTGACTC 

TCTGCCCTGGGCTCTGATAAAACCTTCACCATTAACTTGCTGTGTGACCTTGGGCATGTG 

1981 + + + + + + 2040 

AGACGGGACCCGAGACTATTTTGGAAGTGGTAATTGAACGACACACTGGAACCCGTACAC 

GCTTTCCCTCTCTGGCCTCAGTCTGTTCATCTCCCAAATGGATAATGAAGCCTCTTGGGA 

2041 ♦ + + + + + 2100 

CGAAAGGGAGAGACCGGAGTCAGACAAGTAGAGGGTTTACCTATTACTTCGGAGAACCCT 

GGCCCTACCATAGGATCTGTTGCCATGCTCAAATGAGTTACTGAATAAGGTGCTTCTGCT 

2101 + + + + + + 2160 

CCGGGATGGTATCCTAGACAACGGTACGAGTTTACTCAATGACTTATTCCACGAAGACGA 

TCTTCTAGAGATGGTGCTAAAGAAAGGACTAGCATATGAGACTTCTGGTACCAATGGGGC 

2161 + + ► + + + 2220 

AGAAGATCTCTACCACGATTTCTTTCCTGATCGTATACTCTGAAGACCATGGTTACCCCG 

TGGTGGGCATGCTGTCCACTGTGTGGTGCTAGGACTGCCAATGCCAGGCCCAAGGGACAA 

2221 + + + + + — + 22 80 

ACCACCCGTACGACZAGGTGACACACCACGATCCTGACGGTTACGGTCCGGGTTCCCTGTT 



AAAGAACAGAGCTTTTTGTTCTCATGGCTGGCCCTGCTACCTCCGAGGCACCCTGCAGGG 



2281 + +• 



•+ 2340 



TTTCTTGTCTCGAAAAACAAGAGTACCGACCGGGACGATGGAGGCTCCGTGGGACGTCCC 

CAATGCATGTC^TCCCAACCCCCAC^CTCCCCATCCTCCAACCCACTGGTCTCATG^ 

2341 + + + + + + 2400 

GTTACGTACAGTAGGGTTGGGGGTGTGAGGGGTAGGAGGTTGGGTGACCAGAGTACGGGT 
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AAGAAGAGTTGAAGGCATGGGAGCCAACATTTTATTGAAGAAGCCACAGAGGCTGAAATT 
2401 + + + + + + 2460 

TTCTTCTCAACTTCCGTACCCTCGGTTGTAAAATAACTTCTTCGGTGTCTCCGACTTTAA 

CAATAAACACAAGTTTTATGAGTAAAAAAAAAAAAAAAAAA 

2461 + + + +- 2501 

GTTATTTGTGTTCAAAATACTCATTTTTTTTTTTTTTTTTT 



FIGURE 2 
Sheet 5 of 5 



9 / 34 



WO 01/04283 PCT/US00/18980 

CTGCACCTGAAGCATTTGGTGGGTGAGCAGCATGGGCTTTGAGGAGCTGCTGGAGCAGGT 
1 + + + + + + 60 

GACGTGGACTTCGTAAACCACCCACTCGTCGTACCCGAAACTCCTCGACGACCTCGTCCA 

MGFEELLEQV- 

GGGCGGCTTTGGGCCCTTCCAACTGCGGAATGTGGCACTGCTGGCCCTGCCCCGAGTGCT 
51 + + + + + + 120 

CCCGCCGAAACCCGGGAAGGTTGACGCCTTACACCGTGACGACCGGGACGGGGCTCACGA 

GGFGPFQLRNVALLALP RVL- 

t 

GCTACCACTGCACTTCCTCCTGCCCATCTTCCTGGCTGCCGTGCCTGCCCACCGATGTGC 

121 + + + + + + 

CGATGGTGACGTGAAGGAGGACGGGTAGAAGGACCGACGGCACGGACGGGTGGCTACACG 

LPLHFLLPIFLAAVPAHRCA- 

CCTGCCGGGTGCCCCTGCCAACTTCAGCCATCAGGATGTGTGGCTGGAGGCCCATCTTCC 

•181 + + + + + + 240 

GGACGGCCCACGGGGACGGTTGAAGTCGGTAGTCCTACACACCGACCTCCGGGTAGAAGG 

LFGAPANFSHQDVWLEAHLP- 

CCGGGAGCCTGATGGCACGCTCAGCTCCTGCCTCCGCTTTGCCTATCCCCAGGCTCTCCC 

[241 + + + + + + 300 

GGCCCTCGGACTACCGTGCGAGTCGAGGACGGAGGCGAAACGGATAGGGGTCCGAGAGGG 

REPDGTLSSCLRFAYPQALP- 

CAACACCACGTTGGGGGAAGAAAGGCAGAGCCGTGGGGAGCTGGAGGATGAACCTGCCAC 

301 + + + + + + 360 

GTTGTGGTGCAACCCCCTTCTTTCCGTCTCGGCACCCCTCGACCTCCTACTTGGACGGTG 

NTTLGEERQSRGELEDEPAT- 

AGTGCCCTGCTCTCAGGGCTGGGAGTACGACCACTCAGAATTCTCCTCTACCATTGCAAC 

'361 + + + + —+ + 420 

TCACGGGACGAGAGTCCCGACCCTCATGCTGGTGAGTCTTAAGAGGAGATGGTAACGTTG 

VPCSQGWEYDHSEFSSTIAT- 

TGAGTGGGATCTGGTGTGTGAGCAGAAAGGTCTGAACAGAGCTGCGTCCACTTTCTTCTT 

421 + + ♦ + + + 480 

ACTCACCCTAGACCACACACTCGTCTTTCCAGACTTGTCTCGACGCAGGTGT^AAGAAGAA 

EWDLVCEQKGLNRAASTFFF - 

CGCCGGTGTGCTGGTGGGGGCTGTGGCCTTTGGATATCTGTCCGACAGGTTTGGGCGGCG 

:481 + + + + + + 540 

GCGGCCACACGACCACCCCCGACACCGGAAACCTATAGACAGGCTGTCCAAACCCGCCGC 

AGVLVGAVAFGYLSDRFGRR- 
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j 

GCGTCTGCTGCTGGTAGCCTACGTGAGTACCCTGGTGCTGGGCCTGGCATCTGCAGCCTC 

1541 ■+ + + + + + 600 

! CGCAGACGACGACCATCGGATGCACTCATGGGACCACGACCCGGACCGTAGACGTCGGAG 

; R L L LVAYVS T LV'LG LA SAA S - 



CGTCAGCTATGT AATGTTTGCCATCACCCGCACCCTT ACTGGCTCAGCCCTGGCTGGTTT 

1601 + + + + + + 660 

GCAGTCGATACATTACAAACGGTAGTGGGCGTGGGAATGACCGAGTCGGGACCGACCAAA 

i: 

| : VSYVMFAITRTLTGSALAGF- 
j - TACC ATCATCGTGATGCCACTGGAGCTGG AGTGGCTGGATGTGGAGCACCGCACCGTGGC 

feei — + + + + + + 720 

( ATGGTAGTAGCACTACGGTGACCTCGACCTCACCGACCTACACCTCGTGGCGTGGCACCG 

i 

J f TIIVMPLELEWLDVEHRT VA- 



! ; TGGAGTCCTGAGCAGCACCTTCTGGACAGGGGGCGTGATGCTGCTGGCACTGGTTGGGTA 

1721 + + + + + ■ — + 780 

! ACCTCAGGACTCGTCGTGGAAGACCTGTCCCCCGCACTACGACGACGGTGACCAACCCAT 

>; 

j; GVLSSTFWTGGV MtiLALVGY 

t; 

| \ CCTGATACGGGACTGGCGATGGCTTCTGCTAGCTGTCACCCTGCCTTGTGCCCCAAGCAT 

[781 -i- + + + + + 840 

j ! GGACTATGCCCTGACCGCTACCGAAGACGATCGACAGTGGGACGGAACACGGGGTTCGTA 

j; L I RDWRWLLLAVTLPCAP.S I 

; ■ 
t ■ 

j ! CCTCAGCCTCTGGTGGGTGCCTGAGTCTGCACGCTOGCTTCTGACCCAAGGCCA 

|841 + + + + + + 900 

; GG AGTCGGAGACCACCCACGGACTCAGACGTGCGACCGAAGACTGGGTTCCGGTACACTT 

\'. 

U 

j; LSLWWVPESARWLLTQGHVK- 

v. 

j : AGAGGCCCACAGGTACTTGCTCCACTGTGCCAGGCTCAATGGGCGGCCAGTGTGTGAGGA 

1901 + + + + + + 960 

j TCTCCGGGTGTCCATGAACGAGGTGACACGGTCCGAGTTACCCGCCGGTCACACACTCCT 

ji 

i 

EAHRYLLHCARLNGRPVC ED- 



r 



CAGCTTCAGCCAGGAGGCTGTGAGCAAAGTGGCCGCCGGGGAACGGGTGGTCCGAAGACC 

1961 + + + + + + 1020 

! GTCGAAGTCGGTCCTCCGACACTCGTTTCACCGGCGGCCCCTTGCCCACCAGGCTTCTGG 

| SFSQEAVSKVAAGERVVRRP- 



!i TTCATACCTAGACCTGTTCCGCACACCACGGCTCCGACACATCTCACTGTGCTGCGTGGT 



ID21 + + + + + 

! AAGTATGGATCTGGACAAGGCGTGTGGTGCCGAGGCTGTCTAGAGTGACACGACGCACCA 

! 
I 

SYLDLFRTPRLRHI SLCCVV- 

f 
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GGTGTGGTTCGGAGTGAACTTCTCCTATTACGGCCTGAGTCTGGATGTGTCGGGGCTGGG 

1081 + ; + + " + + + 1140 

CCACACCAAGCCTCACTTGAAGAGGATAATGCCGGACTCAGACCTACACAGCCCCGACCC 

VWFGVNFSYYGLSLDVSGLG- 

GCTGAACGTGTACCAGACACAGCTGTTGTTCGGGGCTGTGGAACTGCCCTCCAAGCTGCT 

1141 + + + + + + 1200 

CGACTTGCACATGGTCTGTGTCGACAACAAGCCCCGACACCTTGACGGGAGGTTCGACGA 

LNVYQTQLLFGAVELPSKLL- 

GGTCTACTTGTCGGTGCGCTACGCAGGACGCCGCCTCACGCAAGCCGGGACACTGCTGGG 

1201 + + + + + + 1260 

CCAGATGAACAGCCACGCGATGCGTCCTGCGGCGGAGTGCGTTCGGCCCTGTGACGACCC 

VYL.SVRYAGRRLTQAGTLLG- 

CACGGCCCTGGCGTTCGGCACTAGACTGCTAGTGTCCTCCGATATGAAGTCCTGGAGCAC . 

1261 + + + + + + 1320 

GTGCCGGGACCGCAAGCCGTGATCTGACGATCACAGGAGGCTATACTTCAGGACCTCGTG 

TALAFGTRLLVSSDMKSWST- 

TGTCCTGGCAGTGATGGGGAAAGCTTTTTCTGAAGCTGCCTTCACCACTGCCTACCTGTT 

1321 + + + + + 1 138° 

ACAGGACCGTCACTACCCCTTTCGAAAAAGACTTCGACGGAAGTGGTGACGGATGGACAA 

VLAVMGKAF SEAAFTTAYLF 

CACTTCAGAGTTGTACCCTACGGTGCTCAGACAGACAGGGATGGGGCTGACTGCACTGGT 

1381 + + + + + "+ 1^40 

GTGAAGTCTCAACATGGGATGCCACGAGTCTGTCTGTCCCTACCCCGACTGACGTGACCA 

TSELYPTVLRQTGM GLTALV- 

GGGCCGGCTGGGGGGCTCTTTGGCCCCACTGGCGGCCTTGCTGGATGGAGTGTGGCTGTC 

1441 + + + + + + 1500 

CCCGGCCGACCCCCCGAGAAACCGGGGTGACCGCCGGAACGACCTACCTCACACCGACAG 

GRLGG SLAPLAALLDGVWLS - 

ACTGCCCAAGCTTACTTATGGGGGGATCGCCCTGCTGGCTGCCGGCACCGCCCTCCTGCT 

1501 + + + + + + 1560 

TGACGGGTTCGAATGAATACCCCCCTAGCGGGACGACCGACGGCCGTGGCGGGAGGACGA 

LPKLTYGGIALLAAGTALLL - 

GCCAGAGACGAGGCAGGCACAGCTGCCAGAGACCATCCAGGACGTGGAGAGAAAGAGAGA 

1561 + + + + + + 1620 

CGGTCTCTGCTCCGTCCGTGTCGACGGTCTCTGGTAGGTCCTGCACCTCTCTTTCTCTCT 

PETRQAQLPETIQDVERKRD- 
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BNSDOCIO: <WO Q104283A2 I > 



WO 01/04283 



r 

PCTAJSOO/18980 



TGGTGCTAAAGAAAGGACTAGCATATGAGACTTCTGGTACCAATGGGGCTGGTGGGCATG 
1621 + + + + + + 1680 

ACCACGATTTCTTTCCTGATCGTATACTCTGAAGACCATGGTTACCCCGACCACCCGTAC 
GAKERTS I * 

CTGTCCACTGTGTGGTGCTAGGACTGCCAATGCCAGGCCCAAGGGACAAAAAGAACAGAG 

1681 + + + + + + 1740 

GACAGGTGACACACCACGATCCTGACGGTTACGGTCCGGGTTCCCTGTTTTTCTTGTCTC 

CTTTTTGTTCTCATGGCTGGCCCTGCTACCTCCGAGGCACCCTGCAGGGCAATGCATGTC 

1741 + + + + + + 1800 

GAAAAACAAGAGTACCGACCGGGACGATGGAGGCTCCGTGGGACGTCCCGTTACGTACAG 

ATCCCAACCCCCACACTCCCCATCCTCCAACCCACTGGTCTCATGCCCAAAGAAGAGTTG 

1801 + + + + + + 1860 

TAGGGTTGGGGGTGTGAGGGGTAGGAGGTTGGGTGACCAGAGTACGGGTTTCTTCTCAAC 

AAGGCATGGGAGCCAACATTTTATTGAAGAAGCCACAGAGGCTGAAATTCAATAAACACA 

1861 + + + + + • + 1920 

TTCCGTACCCTCGGTTGTAAAATAACTTCTTCGGTGTCTCCGACTTTAAGTTATTTGTGT 

AGTTTTATGAGTAAAAAAAAAAAAAAAAAA 

1921 + + + 1950 

TCAAAATACTCATTTTTTTTTTTTTTTTTT 
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nJSDOCID: <WO 01 04283 A2_l_> 



WO 01/04283 PCT/US00/18980 

GCAGGGACCTCAACTACACTGATCACCAGCCCCATCGGATCCAGACCCGGCCACCAGTGC 

! „; + + + - + + + 60 

CGTCCCTGGAGTTGATGTGACTAGTGGTCGGGGTAGCCTAGGTCTGGGCCGGTGGTCACG 

CATGACCTTCTCGGAGATCCTGGACCGTGTGGGAAGCATGGGCCATTTCCAGTTCCTGCA 

61 + + + + + + 120 

GTACTGGAAGAGCCTCTAGGACCTGGCACACCCTTCGTACCCGGTAAAGGTCAAGGACGT 

MTFSEILDRVGSMGHFQFLH- 

TGTAGCCATACTGGGCCTCCCGATCCTCAACATGGCCAACCACAACCTGCTGCAGATCTT 

12 1 + + -+ + + + 180 

ACATCGGTATGACCCGGAGGGCTAGGAGTTGTACCGGTTGGTGTTGGACGACGTCTAGAA 

VAILGLPILNMANHNLLQIF- 

CACAGCCGCCACCCCTGTCCACCACTGTCGCCCGCCCCACAATGCCTCCACAGGGCCTTG 

18 l + + + + + + 240 

GTGTCGGCGGTGGGGACAGGTGGTGACAGCGGGCGGGGTGTTACGGAGGTGTCCCGGAAC 

TAAT PVHHCRPPHNAS TGPW- 

GGTGCTCCCCATGGGCCCAAATGGGAAGCCTGAGAGGTGCCTCCGTTTTGTACATCCGCC 

241 + + + + + + 300 

CCACGAGGGGTACCCGGGTTTACCCTTCGGACTCTCCACGGAGGCAAAACATGTAGGCGG 

VLPMGPNGKPERCLRFVHPP- 

CAATGCCAGCCTGCCCAATGACACCCAGAGGGCCATGGAGCCATGCCTGGATGGCTGGGT 

301 + + + + + + 360 

GTTACGGTCGGACGGGTTACTGTGGGTCTCCCGGTACCTCGGTACGGACCTACCGACCCA 

NASLPNDTQRAMEPCLDGWV- 

CTACAACAGCACCAAGGACTCCATTGTGACAGAGTGGGACTTGGTGTGCAACTCCAACAA 

361 + + + + + + 420 

GATGTTGTCGTGGTTCCTGAGGTAACACTGTCTCACCCTGAACCACACGTTGAGGTTGTT 

YNSTKDSIVTEWDtiVCNSNK- 

ACTGAAGGAGATGGCCCAGTCTATCTTCATGGCAGGTATACTGATTGGAGGGCTCGTGCT 

421 + + + + + + 480 

TGACTTCCTCTACCGGGTCAGATAGAAGTACCGTCCATATGACTAACCTCCCGAGCACGA 

LKEMAQSIFMAGILIGGLVL- 

TGGAGACCTGTCTGACAGGTTTGGCCGCAGGCCCATCCTGACCTGCAGCTACCTGCTGCT 

481 + + + + + + 540 

ACCTCTGGACAGACTGTCCAAACCGGCGTCCGGGTAGGACTGGACGTCGATGGACGACGA 

GDLSDRFGRRPILTCSYIiLIi- 

GGCAGCCAGCGGCTCCGGTGCAGCCTTCAGCCCCACCTTCCCCATCTACATGGTCTTCCG 
541 + + + + + + 600 
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3 NS DOC ID: <WO 0104283A2_I_> 



WO 01/04283 PCT/US00/1898O 

CCGTCGGTCGCCGAGGCCACGTCGGAAGTCGGGGTGGAAGGGGTAGATGTACCAGAAGGC 

AASGSGAAFSPTFPIYMVFR- 

CTTCCTGTGTGGCTTTGGCATCTCAGGCATTACCCTGAGCACCGTCATCTTGAATGTGGA 
601 + + + + + + 660 

GAAGGACACACCGAAACCGTAGAGTCCGTAATGGGACTCGTGGCAGTAGAACTTACACCT 

FLCGFGISGITLSTVILNVE- 

ATGGGTGCCTACCCGGATGCGGGCCATCATGTCGACAGCACTCGGGTACTGCTACACCTT 
661 + + + + + + 720 

TACCCACGGATGGGCCTACGCCCGGTAGTACAGCTGTCGTGAGCCCATGACGATGTGGAA 

WVPTRMRAIMSTALGYCYTF 

TGGCCAGTTCATTCTGCCCGGCCTGGCCTACGCCATCCCCCAGTGGCGTTGGCTGCAGTT 
721 + + + + + + 780 

ACCGGTCAAGTAAGACGGGCCGGACCGGATGCGGTAGGGGGTCACCGCAACCGACGTCAA 
GQFI LPGLAY AI PQWRWLQL- 

AACTGTGTCCATTCCCTTCTTCGTCTTCTTCCTATCATCCTGGTGGACACCAGAGTCCAT 
781 + + + + + + 840 

TTGACACAGGTAAGGGAAGAAGCAGAAGAAGGATAGTAGGACCACCTGTGGTCTCAGGTA 

TVSIPFFVFFLSSWWTPESI - 

ACGCTGGTTGGTCTTGTCTGGAAAGTCCTCGAAGGCCCTGAAGATACTCCGGCGGGTGGC 

841 + + + + + + 900 

TGCGACCAACCAGAACAGACCTTTCAGGAGCTTCCGGGACTTCTATGAGGCCGCCCACCG 

RWLVLSGKSSKALKILRRVA- 

TGTCTVrCAATGGCAAGAAGGAAGAGGGAGAAAGGCTCAGCTTGGAGGAGCTCAAACTCAA 
901 + + + + + + 960 

ACAGAAGTTACCGTTCTTCCTTCTCCCTCTTTCCGAGTCGAACCTCCTCGAGTTTGAGTT 

VFNGKKEEGERLSLEELKLN- 

CCTGCAGAAGGAGATCTCCTTGGCCAAGGCCAAGTACACCGCAAGTGACCTGTTCCGGAT 

961 + + + + + + 1020 

GGACGTCTTCCTCTAGAGGAACCGGTTCCGGTTCATGTGGCGTTCACTGGACAAGGCCTA 

LQKE1 SLAKAKYTASDLFRI - 

ACCCATGCTGCGCCGCATGACCTTCTGTCTTTCCCTGGCCTGGTTTGCTACCGGT^ 

1021 + + + + + + 1080 

TGGGTACGACGCGGCGTACTGGAAGACAGAAAGGGACCGGACCAAACGATGGCCAAAACG 

PMLRRMTFCLSLAWFATGFA 

CTACTATAGTTTGGCTATGGGTGTGGAAGAATTTGGAGTCAACCTCTACATCCTCCAGAT 
1081 + + + + + . + 1140 
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JSDOCID: <WO 01 04283A2_I_> 



WO 01/04283 PCT/USOO/18980 

GATGATATCAAACCGATACCCACACCTTCn , TAAACCTCAGTTGGAGATGTAGGAGGTCTA 
YVSLAMGVEEFGVNLY I LQI - 

CATCTTTGGTGGGGTCGATGTCCCAGCCAAGTTCATCACCATCCTCTCCTTAAGCTACCT 

lb-41 - + + + + + + 1200 

GTAGAAACCACCCCAGCTACAGGGTCGGTTCAAGTAGTGGTAGGAGAGGAATTCGATGGA 



1 



IFGGVDVPAKFITILSLSYL- 

GGGCCGGCATACCACTCAGGCCGCTGCCCTGCTCCTGGCAGGAGGGGCCATCTTGGCTCT 

ilfeOl + : — + + + + + 1260 

CCCGGCCGTATGGTGAGTCCGGCGACGGGACGAGGACCGTCCTCCCCGGTAGAACCGAGA 

GRHTTQAAALLLAGGAILAL- 

CACCTTTGTGCCCTTGGACTTGCAGACCGTGAGGACAGTA'TTGGCTGTGTTTGGGAAGGG 

iksi + + + + + + 1320 

GTGGAAACACGGGAACCTGAACGTCTGGCACTCCTGTCATAACCGACACAAACCCTTCCC 

TFVPLDLQTVRTVLAVFGKG- 

ATGCCTATCCAGCTCCTTCAGCTGCCTCTTCCTCTACACAAGTGAATTATACCCCACAGT 

01&21 + + + + + + 1380 

TACGGATAGGTCGAGGAAGTCGACGGAGAAGGAGATGTGTTCACTTAATATGGGGTGTCA 

CLSSSFSCLFLYTSELYPTV- 

CATCAGGCAAACAGGTATGGGCGTAAGTAACCTGTGGACCCGCGTGGGAAGCATGGTGTC 

381 + + + + + + 1440 

GTAGTCCGTTTGTCCATACCCGCATTCATTGGACACCTGGGCGCACCCTTCGTACCACAG 

IRQTGMGVSNLWTRVGSMVS - 

CCCGCTGGTGAAAATCACGGGTGAGGTACAGCCCTTCATCCCCAATATCATCTACGGGAT 

1B41 + + + + + :+ I 500 

GGGCGACCACTTTTAGTGCCCACTCCATGTCGGGAAGTAGGGGTTATAGTAGATGCCCTA 

PLVKITGEVQPFI PNI IYG I 
CACCGCCCTCCTCGGGGGCAGTGCTGCCCTCTTCCTGCCTGAGACCCTGAATCAGCCCTT 

ihoi + + + + + + 1560 

GTGGCGGGAGGAGCCCCCGTCACGACGGGAGAAGGACGGACTCTGGGACTTAGTCGGGAA 

TALLGGSAALFLPETLNQPL- 

GCCAGAGACTATCGAAGACCTGGAAAACTGGTCCCTGCGGGCAAAGAAGCCAAAGCAGGA 

+ + + + + + 1620 

CGGTCTCTGATAGCTTCTGGACCTTTTGACCAGGGACGCCCGTTTCTTCGGTTTCGTCCT 

PETIEDLE NWSLRAKKPKQE- 

GCCAGAGGTGGAAAAGGCCTCCCAGAGGATCCCTCTACAGCCTCACGGACCAGGCCTGGG 



j 521 



+ 1680 
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BNSDOC1D: <WO 0104283A2 I > 



WO 01/042S3 

PC TYUSO 0/18980 

CGGTCTCCACCTTTTCCGGAGGGTCTCCTAGGGAGATGTCGGAGTGCCTGGTCCGGACCC 

PEVEKASQRI PLQPHGPGLG- 

CTCCAGCTGAGGACAACGGAACCCCCTTTCCCTGCCCTCCAGAGACTGATCCTAGCCAGG 
3#81 + + + + + + 1740 

GAGGTCGACTCCTGTTGCCTTGGGGGAAAGGGACGGGAGGTCTCTGACTAGGATCGGTCC 



CACCTTAGGAGTATAGGGAGGCCCCATATAGGTCCATCCTCCTAGGATGAAGCCTTCTGA 

lfp41 + + + + + + 1800 

GTGGAATCCTCATATCCCTCCGGGGTATATCCAGGTAGGAGGATCCTACTTCGGAAGACT 

GAGCTTGGTGAAGGTGTCTCCATCACCACCACCAGAGCCTCCTGCCCAGCCCTGGCCAGT 
$01 + + + + + + I860 

CTCGAACCACTTCCACAGAGGTAGTGGTGGTGGTCTCGGAGGACGGGTCGGGACCGGTCA 

TCAAAGGTTCAAGCCATCCCTGCCCTTGTTCTCCCTGCAACCCAAGCCCTGCCATTCTTC 

1#61 + + + + + + 1920 

AGTTTCCAAGTTCGGTAGGGACGGGAACAAGAGGGACGTTGGGTTCGGGACGGTAAGAAG 

TGTCTAGCCCTTCCCCACTGGCCAACTTCCCCCACTGTCCCGGTCCTCTTCCCCTGAGGT 

l|p21 + + + + + + 1980 

ACAGATCGGGAAGGGGTGACCGGTTGAAGGGGGTGACAGGGCCAGGAGAAGGGGACTCCA 

CCCCTGATATCCCCTGGCTCAGTCCTAACAAGACTGAGTCTTAACAAGATGAGAAGTCCT 

11$ 81 + + + + + + 2040 

GGGGACTATAGGGGACCGAGTCAGGATTGTTCTGACTCAGAAT^ 

CCCCTTCTTGCCTCCCACACTTTTCTTTGATGGGAGGTTTCAATAAACAGCGATAAGAAC 

21041 + + + + + + 2100 

GGGGAAGAACGGAGGGTGTGAAAAGAAACTACCCTCCAAAGTTATTTGTCGCTATTCTTG 

TCTAAAAAAAAAAAAAAAAAA 

2^01 + +- 2121 

AGATTTTTTTTTTTTTTTTTT 
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WO 01/04283 PCT/US00/18980 

CAAAT^ATTTCTTACGTGACTTTAGAGAAAACGGCTACCTATCTGACCCCAAAACGACTT 

1 + + + + + + 60 

GTTTAATAAAGAATGCACTGAAATCTCTTTTGCCGATGGATAGACTGGGGTTTTGCTGAA 

GAGGAAACTGTTTCCACGGTCCTGCTGCAGGGGGGAAGCACAGTCGTCAAGAAGAGAGTG 

61 + + + + + + 120 

CTCCTTTGACAAAGGTGCCAGGACGACGTCCCCCCTTCGTGTCAGCAGTTCTTCTCTCAC 

GGGTCAGGATCAAAACACATTTAGTGTGACTTAGGGAAAGAAAACATTTTCCCTCTTTGA 

L2 1 + + — + + + + 180 

CCCAGTCCTAGTTTTGTGTAAATCACACTGAATCCCTTTCTTTTGTAAAAGGGAGAAACT 

ACCTCTCTGGATACAGTCATTTTGCCTCTACTTGAGGATCAACTGTTCAACCTCAATGGC 

81 + + + + + + 240 

TGGAGAGACCTATGTCAGTAAAACGGAGATGAACTCCTAGTTGACAAGTTGGAGTTACCG 

M A - 

CTTTCAGGACCTCCTGGGTCACGCTGGTGACCTGTGGAGATTCCAGATCCTTCAGACTGT 

g 41 + + + + + + 300 

GAAAGTCCTGGAGGACCCAGTGCGACCACTGGACACCTCTAAGGTCTAGGAAGTCTGACA 

FQDLLGHAGDLWRFQILQTV- 

TTTTCTCTCAATCTTTGCTGTTGCTACATACCTTCATTTTATGCTGGAGAACTTCACTGC 

301 + + + + * + + 360 

AAAAGAGAGTTAGAAACGACAACGATGTATGGAAGTAAAATACGACCTCTTGAAGTGACG 

FIjS I FAVATYLHFMLENFTA- 

ATTCATACCTGGCCATCGCTGCTGGGTCCACATCCTGGACAATGACACTGTCTCTGACAA 

61 + + + + + + 420 

TAAGTATGGACCGGTAGCGACGACCCAGGTGTAGGACCTGTTACTGTGACAGAGACTGTT 

FIPGHRCWVHILDNDTVSDN- 

TGACACTGGGGCCCTCAGCCAAGATGCACTCTTGAGAATCTCCATCCCACTGGACTCAAA 

21 + + + + + + 480 

ACTGTGACCCCGGGAGTCGGTTCTACGTGAGAACTCTTAGAGGTAGGGTGACCTGAGTTT 

DTGAL SQDALLR I S I PLDSN- 

CATGAGGCCAGAGAAGTGTCGTCGCTTTGTTCATCCTCAGTGGCAGCTCCTTCACCTGAA 

ft81 + + + + +" + 540 

GTACTCCGGTCTCTTCACAGCAGCGAAACAAGTAGGAGTCACCGTCGAGGAAGTGGACTT 

MRPEKCRRFVHPQWQLLHLN- 

TGGGACCTTCCCCAACAGAAGTGACGCAGACATGGAGCC 

P 41 + + + + + + 600 

ACCCTGGAAGGGGTTGTGTTCACTGCGTCTGTACCTCGGGACACACCTACCGACCCACAT 

GTFPNTSDADMEPCVDGWVY- 
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BNSDOCID: <WO 01 04283A2_I_> 



WO 01/04283 

R 



PCT/USOO/18980 



TGACAGAATCTCCTTCTCATCCACCATCGGTGACCTGAAGTGGGATCTGGTATGTGACTC 
|E01 + + + + + + 660 

ACTGT CTT AGAGG AAG AGT AGGTGGTAG CC ACTGGACTTCACCCT AG ACC AT AC ACTGAG 
DRISFSSTIGDLKWDLVCDS- 

TCAATCACTGACTTCAGTGGCTAAATTTGTATTCATGGCTGGAATGATGTTGGGAGGCAT 
61 + + + + + + 720 

AGTTAGTGACTGAAGTCACCGATTTAAACATAAGTACCGACCTTACTACAACCCTCCGTA 

QSL TSVAKFVFMAGMMLGG I 

CTTAGGCGTTCATTTATCAGACAGGTTTGGGAGAAGTTTCGTGCTCAGATGGTGTTACCT 
21 + + + + + + 780 

GAATCCGCAAGTAAATAGTCTGTCCAAACCCTCTTCAAAGCACGAGTCTACCACAATGGA 

LG VHLSDRFGRSFVLRWCYL- 

CCAGGTTGCCATTGTTGGCACTTGTGCAGCGTTGGCTCCCACTTTCCTCATTTACTGCTC 

81 + + + + + + 840 

GGTCCAACGGTAACAACCGTGAACACGTCGCAACCGAGGGTGAAAGGAGTAAATGACGAG 

QVAIVGTCAALAPTFLIYCS- 

AGTACGCTTCTTGTCTGGGATTGCTGCAATGAGCTTCATAACAAATACTATTATGTTAAT 

41 + +- + '+ + + 900 

TCATGCGAAGAACAGACCCTAACGACGTTACTCGAAGTATTGTTTATGATAATACAATTA 



VRFLSGIAAM 



ITNTIMLI 



AGCCGAGTGGGCAACACACAGATTCCAGGCCATGGGAATTACATTGGGAATGTGCCCTTC 
01 + + + + + + 960 

TCGGCTCACCCGTTGTGTGTCTAAGGTCCGGTACCCTTAATGTAACCCTTACACGGGAAG 

AEWATHRFQAMGITLGMCPS- 

TGGTATTGCATTTATGACCCTGGCAGGCCTGGCTTTTGCCATTCGAGACTGGCATATCCT 
61 + + + + - + + 1020 

ACCATAACGTAAATACTGGGACCGTCCGGACCGAAAACGGTAAGCTCTGACCGTATAGGA 
GIAFMTLAGLAFAIRDWHIL- 



CCAGCTGGTGGTGTCTGTACCATACTTTGTGATCTTTCTGACCTCAAGTTGGCTGCTAGA 

321 + + + + + + 1080 

GGTCGACCACCACAGACATGGTATGAAACACTAGAAAGACTGGAGTTCAACCGACGATCT 

QLVVSVPYFVIFLTSSWLLE- 



GTCTGCTCGGTGGCTCATTATCAACAATAAACCAGAGGAAGGCTTAAAGGAACTTAGAAA 

WB1 + + + + + + 1140 

CAGACGAGCCACCGAGTAATAGTTGTTATTTGGTCTCCTTCCGAATTTCCTTGAATCT^ 



SARWL1 INNKPEEGLKELRK- 
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NSDOCID: <WO 0104283A2J_> 



WO 01/04283 PCT/US00/1 8980 

AGCTGCACACAGGAGTGGAATGAAGAATGCCAGAGACACCCTAACCCTGGAGATTTTGAA 

1141 + + + + + + 1200 

TCGACGTGTGTCCTCACCTTACTTCTTACGGTCTCTGTGGGATTGGGACCTCTAAAACTT 

AAHRSGMKNARDTLTLEILK- 

ATCCACCATGAAAAAAGAACTGGAGGCAGCACAAAAAAAAAAACCTTCTCTGTGTGAAAT 

1201 + + + + + + 1260 

TAGGTGGTACTTTTTTCTTGACCTCCGTCGTGTTTTTTTTTTTGGAAGAGACACACTTTA 

STMKKELEAAQKKKPSLCEM- 

GCTCCACATGCCCAACATATGTAAAAGGATCTCCCTCCTGTCCTTTACGAGATTTGCAAA 

1261 + + + + + + 1320 

CGAGGTGTACGGGTTGTATACATTTTCCTAGAGGGAGGACAGGAAATGCTCTAAACGTTT 

LHMPNICKRISLLSFTRFAN- 

CTTTATGGCCTATTTTGGCCTTAATCTCCATGTCCAGCATCTGGGGAACAATGTTTTCCT 

1321 + + + + + * + 1380 

GAAATACCGGATAAAACCGGAATTAGAGGTACAGGTCGTAGACCCCTTGTTACAAAAGGA 

FMAYFGLNLHVQHLGNNVFL - 

GTTGCAGACTCTCTTTGGTGCAGTCATCCTCCTGGCCAACTGTGTTGCACCTTGGGCACT 

1381 + + + + + + 1440 

CAACGTCTGAGAGAAACCACGTCAGTAGGAGGACCGGTTGACACAACGTGGAACCCGTGA 

LQTLFGAVI LLANCVAPWAL - 

GAAATACATGAACCGTCGAGCAAGCCAGATGCTTCTCATGTTCCTAGTGGCAATCTGCCT 

1441 + + : + + + + I 500 

CTTTATGTACTTGGCAGCTCGTTCGGTCTACGAAGAGTACAAGGATGACCGTTAGACGGA 

KY MNRRAS QMLLMFLLA I CL - 

TCTGGCCATCATATTTGTGCCACAAGAAATGCAGACGCTGCGTGAGGTTTTGGCAACACT 

1501 + + — • + + + 1560 

AGACCGGTAGTATAAACACGGTGTTCTTTACGTCTGCGACGCACTCCAAAACCGTTGTGA 

LAI IFVPQEMQTLREVIiATL - 

GGGCTTAGGAGCGTCTGCTCTTGCCAATACCCTTGCTTTTGCCCATGGAAATGAAGTAAT 

1561 + + ♦ + + 1620 

CCCGAATCCTCGCAGACGAGAACGGTTATGGGAACGAAAACGGGTACCTTTACTTCATTA 

r 

GLGASALANTLAFAHGNEVI - 

TCCCACCATAATCAGGGCAAGAGCTATGGGGATCAATGCAACCTTTGCTAATATAGCAGG 

1621 + + + + + I 680 

AGGGTGGTATTAGTCCCGTTCTCGATACCCCTAGTTACGTTGGAAACGATTATATCGTCC 

PT I IRARAMG INATFANI AG - 
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WO 01/04283 



PCT/US00/18980 



AGCCCTGGCTCCCCTCATGATGATCCTAAGTGTGTATTCTCCACCCCTGCCCTGGATCAT 

1681 + + + + + : + 1740 

TCGGGACCGAGGGGAGTACTACTAGGATTCACACATAAGAGGTGGGGACGGGACCTAGTA 

ALAPLMMILSVYSPPLPWII- 

CTATGGAGTCTTCCCCTTCATCTCTGGCTTTGCTTTCCTCCTCCTTCCTGAAACCAGGAA 

1741 + + + + + + 1800 

GATACCTCAGAAGGGGAAGTAGAGACCGAAACGAAAGGAGGAGGAAGGACTTTGGTCCTT 

YGVFPFISGFAFLLLPETRN- 

CAAGCCTCTGTTTGACACCATCCAGGATGAGAAAAATGAGAGAAAAGACCCCAGAGAACC 

1801 + + + + + + 1860 

GTTCGGAGACAAACTGTGGTAGGTCCTACTCTTTTTACTCTCTTTTCTGGGGTCTCTTGG 

KPLFDTIQDEKNERKDPREP- 

AAAGCAAGAGGATCCGAGAGTGGAAGTGACGCAGTTTTAAGGAATTCCAGGAGCTGACTG 

1861 + + + + + + 1920 

TTTCGTTCTCCTAGGCTCTCACCTTCACTGCGTCAAAATTCCTTAAGGTCCTCGACTGAC 

KQEDPRVEVTQF* 

CCGATCAATGAGCCAGATGAAGGGAACAATCAGGACTATTCCTAGACACTAGCAAAA 

1921 + + + + + 1977 

GGCTAGTTACTCGGTCTACTTCCCTTGTTAGTCCTGATAAGGATCTGTGATCGTTTT 
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WO 01/04283 

CTCCTGATAGCAAAAGAACTGAGGAAGCTCTTTCCACTACGGCTGTATTGCACTGGTGAG 



! 



GAGGACTATCGTTTTCTTGACTCCTTCGAGAAAGGTGATGCCGACATAACGTGACCACTC 



60 



TCCGGGCCCATGGATGAGAAATTGATGCGAGGATCAATACAAGCTTAATTTGAATTAATA 

61 + + + + + + 120 

AGGCCCGGGTACCTACTCTTTAACTACGCTCCTAGTTATGTTCGAATTAAACTTAATTAT 

AAAGGAAATATTTTCTCCCTTTGAACTTATCTCCGTAAAGCCATTGTGCCTCCTCTTGGG 

JL21 + + + + + + 180 

TTTCCTTTATAAAAGAGGGAAACTTGAATAGAGGCATTTCGGTAACACGGAGGAGAACCC 

GGTCACGTGTTCACAATCAATGGCCTTTGAGGAGCTCTTGAGTCAAGTTGGAGGCCTTGG 

L81 + + + + + + 240 

CCAGTGCACAAGTGTTAGTTACCGGAAACTCCTCGAGAACTCAGTTCAACCTCCGGAACC 

MAFEELLSQVGGLG- 

GAGATTTCAGATGCTTCATCTGGTTTTTATTCTTCCCTCTCTCATGTTATTAATCCCTCA 

B 41 + + + + + + 300 

CTCTAAAGTCTACGAAGTAGACCAAAAATAAGAAGGGAGAGAGTACAATAATTAGGGAGT 

RFQMLHLVFILPSLMLLIPH- 

TATACTGCTAGAGAACTTTGCTGCAGCCATTCCTGGTCATCGTTGCTGGGTCCACATGCT 

jOl + + + + + + 360 

ATATGACGATCTCTTGAAACGACGTCGGTAAGGACCAGTAGCAACGACCCAGGTGTACGA 



J61 



L It 



N F A A A 



HRCWVHML- 



121 



181 



S41 



GGACAATAATACTGGATCTGGTAATGAAACTGGAATCCTCAGTGAAGATGCCCTCTTGAG 

+ + + + + + 420 

CCTGTTATTATGACCTAGACCATTACTTTGACCTTAGGAGTCACTTCTACGGGAGAACTC 

DNNTGSGNE. TGILSEDALLR- 

AATCTCTATCCCACTAGACTCAAATCTGAGGCCAGAGAAGTGTCGTCGCTTTGTCCATCC 

+ + : + + + + 480 

TTAGAGATAGGGTGATCTGAGTTTAGACTCCGGTCTCTTCACAGCAGCGAAACAGGTAGG 

ISIPLDSNLRPEKCRRFVHP- 

CCAGTGGCAGCTTCTTCACCTGAATGGGACTATCCACAGCACAAGTGAGGCAGACACAGA 

+ + + + + + 540 

GGTCACCGTCX3AAGAAGTGGACTTACCCTGATAGGTGTCGTGTTCACTCCGTCTGTGTCT 

QWQLLHLNGTIHSTSEADTE- 



ACCCTGTGTGGATGGCTGGGTATATGATCAAAGCTACTTCCCTTCGACCATTGTGACTAA 



+ + -+ + * 

TGGGACACACCTACCGACCC^TATACTAGTTTCGATGAAGGGAAGCTGGTAACACTGATT 



+ 600 



C V 



w 



K - 
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WO 01/04283 



PCT/US00/18980 



i 



i 



i 



GTGGGACCTGGTATGTGATTATCAGTCACTGAAATCAGTGGTTCAATTCCTACTTCTGAC 
iOl + + + + + + 660 

CACCCTGGACCATACACTAATAGTCAGTGACTTTAGTCACCAAGTTAAGGATGAAGACTG 

WniiVCDYQSLKSVVQFLLIiT - 

TGGAATGCTGGTGGGAGGCATCATAGGTGGCCATGTCTCAGACAGGTTTGGGCGAAGATT 
B61 + + + + + + 720 

ACCTTACGACCACCCTCCGTAGTATCCACCGGTACAGAGTCTGTCCAAACCCGCTTCTAA 



G M L V G G 



IGGHVSDRFGRRF - 



TATTCTCAGATGGTGTTTGCTCCAGCTTGCCATTACTGACACCTGCGCTGCCTTCGCTCC 

1721 + + + + + + 780 

ATAAGAGTCTACCACAAACGAGGTCGAACGGTAATGACTGTGGACGCGACGGAAGCGAGG 

ILRWCLLQLAITDTCAAFAP- 

CACCTTCCCTGTTTACTGTGTACTACGCTTCTTGGCAGGTTTTTCTTCCATGATCATTAT 



781 



840 



GTGGAAGGGACAAATGACACATGATGCGAAGAACCGTCCAAAAAGAAGGTACTAGTAATA 
TFPVYCVLRFLAGFSSMIII- 

ATCAAATAATTCTTTGCCCATTACTGAGTGGATAAGGCCCAACTCTAAAGCCCTGGTAGT 

341 + + + + + + 900 

TAGTTTATTAAGAAACGGGTAATGACTCACCTATTCCGGGTTGAGATTTCGGGACCATCA 

S NN S L P I TEWI R PN S K A I* V V - 

AATATTGTCATCTGGTGCCCTTAGTATTGGACAGATAATCCTGGGAGGCTTGGCTTATGT 

J01 + + + + + + 960 

TTATAACAGTAGACCACGGGAATCATAACCTGTCTATTAGGACCCTCCGAACCGAATACA 



GALS 



G Q 



LGGLAYV- 



CTTCCGAGACTGGCAAACCCTGCACGTGGTGGCGTCAGTACCTTTCCTTGGCCTCCTTCT 

|fe61 + + — + + + + 1020 

j GAAGGCTCTGACCGTTTGGGACGTGCACCACCGCAGTCATGGAAAGGAACCGGAGGAAGA 

i 



DWQ TLHVVASV 



F L G L L L - 



CCTTCAAAGGTGGCTGGTGGAATCTGCTCGGTGGTTGATAATCACCAATAAACTAGATGA ' 

ib2X + + + + + + 1080 

GGAAGTTTCCACCGACCACCTTAGACGAGCCACCAACTATTAGTGGTTATTTGATCTACT 

LQRWLVE SARWLI ITNKLDE- 

GGGCTTAAAGGCACTTAGAAAAGTTGCACGCACAAATGGAATAAAGAATGCTGAAGAAAC 

31P81 + + + -+ + + 1140 

CCCGAATTTCCGTGAATCTTTTCAACGTGCGTGTTO 



GLKALRKVARTNGI KNA EET - 
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CCTGAACATAGAGGTTGTAAGATCCACCATGCAGGAGGAGCTGGATGCAGCACAGACCAA 

1141 + + + + + + 1200 

GGACTTGTATCTCCAACATTCTAGGTGGTACGTCCTCCTCGACCTACGTCGTGTCTGGTT 

LNIEVVRSTMQEELDAAQTK - 

AACTACTGTGTGTGACTTGTTCCGCAACCCCAGTATGCGTAAAAGGATCTGTATCCTGGT 

1201 + + h + + + 1260 

TTGATGACACACACTGAACAAGGCGTTGGGGTCATACGCATTTTCCTAGACATAGGACCA 

TTVCDLFRNPSMRKRICILV- 

ATTTTTGAGATTTGCAAACACAATACCTTTTTATGGTACCATGGTCAATCTTCAGCATGT 

1261 + + + + + + 1320 

TAAAAACTCTAAACGTTTGTGTTATGGAAAAATACCATGGTACCAGTTAGAAGTCGTACA 

FLRFANT IPFYGTMVNLQ HV- 

GGGGAGCAACATTTTCCTGTTGCAGGTACTTTATGGAGCTGTCGCTCTCATAGTTCGATG 

1321 + + + + + + 1380 

CCCCTCGTTGTAAAAGGACAACGTCCATGAAATACCTCGACAGCGAGAGTATCAAGCTAC 

GSNIFLLQVLYGAVALIVRC - 

TCTTGCTCTTTTGACACTAAATCATATGGGCCGTCGAATAAGCCAGATATTGTTCATGTT 

1381 + + + : + + + 1440 

AGAACGAGAAAACTGTGATTTAGTATACCCGGCAGCTTATTCGGTCTATAACAAGTACAA 

LALLTLNHMGRRI SQI LFMF - 

CCTGGTGGGCCTTTCC^TTTTGGCCAACACGTTTGTGCCCAAAGAAATGCAGACCCTGCG 

1441 + + + + + + 1500 

GGACCACCCGGAAAGGTAAAACCGGTTGTGCAAACACGGGTTTCTTTACGTCTGGGACGC 

LVGLS I LANTFVPKEMQTLR 

TGTGGCTTTGGCATGTCTGGGAATCGGCTGTTCTGCTGCTACTTTTTCCAGTGTTGCTGT 

1501 + — + + ' + + ' + 1560 

ACACCGAAACCGTACAGACCCTTAGCCGACAAGACGACGATGAAAAAGGTCACAACGACA 

VALACLG IGCSAATFSSVAV- 

TCACTTCATTGAACTCATCCCCACTGTTCTCAGGGCAAGAGCTTCAGGAATAGATTTAAC 

1561 + + + + + + 1620 

AGTGAAGTAACTTGAGTAGGGGTGACAAGAGTCCCGTTCTCGAAGTCCTTATCTAAATTG 

HFIELIPTVLRARASGIDLT- 

GGCTAGTAGGATTGGAGCAGCACTGGCTCCCCTCTTGATGACCTTAACGGTATTTTTTAC 

1621 + + + + + + 1680 

CCGATCATCCTAACCTCGTCGTGACCGAGGGGAGAACTACTGGAATTGCCATAAAAAATG 

AS RIGAALAPLLMTLTVFFT- 
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CACTTTGCCATGGATCATTTATGGAATCTTCCCCATCATTGGTGGCCTTATTGTCTTCCT 
1681 + + + + + + 1740 

GTGAAACGGTACCTAGTAAATACCTTAGAAGGGGTAGTAACCACCGGAATAACAGAAGGA 

TLPWIIYGIFPIIGGLIVFL- 

CCTACCAGAAACCAAGAATCTGCCTTTGCCTGACACCATCAAGGATGTGGAAAATCAAAA 
1741 + + + + + + iQOO 

GGATGGTCTTTGGTTCTTAGACGGAAACGGACTGTGGTAGTTCCTACACCTTTTAGTTTT 

LPETKNLPLPDTIKDVENQK- 

AAAAAATCTCAAGGAAAAGGCATAAAAATGATTGCTACACAAAAGTGACCAAATTTTAAG 

1801 ■ + ■ + + + — + + I860 

TTTTTTAGAGTTCCTTTTCCGTATTTTTACT^ 

KNLKEKA* 

AAGCCTTCATGAGCTGATTGGTGGGGAAATTCAGAAAAAAAAATACAGGAAAAGAACACA 

1861 + + + + + + 1920 

TTCGGAAGTACTCGACTAACCACCCCTTTAAGTCTTTTTTTTTATGTCCTTTTCTTGTGT 

CCAGAAGGGTTTTTTTCCCTACAACCAGCAAGAACATATATTAGATACATGAATCTCAAT 

1921 + + + + + + 1980 

GGTCTTCCCAAAAAAAGGGATGTTGGTCGTTCTTGTATATAATCTATGTACTTAGAGTTA 

TATAATTATGGCATTAATTTGCATTTTATTTCAAAATTAACTTGTGGGGACATGTAATCT 

1981 + + + — + + + 2040 

ATATTAATACCGTAATTAAACGTAAAATAAAGTTTTAATTGAACACCCCTGTACATTAGA 

CTTGAGCAATCTGATATTTTTGGGAAGTCCTTTAAAAAGTTACAAATTTATCAATAAATT 

2041 + + + + + + 2100 

GAACTCGTTAGACTATAAAAACCCTTCAGGAAATTTTTCAATGTTTAAATAGTTATTTAA 

ACTAGTAGATAAGATGATTCAGAAACAAAGGAAAATCACAGAATTAGGATGTGGCTGGCT 

2101 + + + + + + 2160 

TGATCATCTATTCTACTAAGTCTTTGTTTCCTTTTAGTGTCTTAATCCTACACCGACCGA 

TGGTGTATGAAGCACCATGTGATGAATTCATAAAGTTGCAAAAGTCAAAACAATACTGTA 

2161 + + + + + + 2220 

ACCACATACTTCGTGGTACACTACTTAAGTAT1TCAACGTTTTCAGTTTTGTTATGACAT 

CATGCAACCAGAAATCAAATTAAATCCAGAAATAGAGACCTATATAAATGCATTTAATAC 

2221 + + + . + + + 2280 

GTACGTTGGTCTTTAGTTTAATTTAGGTCTTTATCTCTGGATATATTTACGTAAATTATG 

ATGATACTTTTGACATATTAAGCCATTGGAAAACGGAAGGATTAGATACTTAAATAACAT 

2281 + + + + + + 2340 

TACTATGAAAACTGTATAATTCGGTAACCTTTTGCCTTCCTAATCTATGAATTTATTGTA 

TGCTATCTCTTTGTAAATACAGTCACTAAATGATGTTAGTTACTTTTCCATGGTGGAATT 

2341 + + + + + + 2400 

ACGATAGAGAAACATTTATGTCAGTGATTTACTACAATCAATGAAAAGGTACCACCTTAA 
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TT/VATTACTTTTTCTTTGTAATTTTTCTCTCTGTATATTTTAAACAAATAGCTGGTATAG 

401 + + + + + + 2460 

AATTAATGAAAAAGAAACATTAAAAAGAGAGACATATAAAATTTGTTTATCGACCATATC 

TTTACAATATTATAAAGATATTGTTCAAATTGAAGGGCAAAGGCCAGGTTCAGCAATTTT 

461 + + + + + + 2520 

AAATGTTATAATATTTCTATAACAAGTTTAACTTCCCGTTTCCGGTCCAAGTCGTTAAAA 

CAAACTGTATGTACATTTAATAAAATAACTATAAATTAAAAAATTATATTTCAAATGATG 

521 + + + + + + 2580 

GTTTGACATACATGTAAATTATTTTATTGATATTTAATTTTTTAATATAAAGTTTACTAC 

TGACTAATAAATGAAAGTACATATAGTAGTAAAGTAATTTCAGGCAAACCTATATAACCA 

gi + + + + + + 2640 

ACTGATTATTTACTTTCATGTATATCATCATTTCATTAAAGTCCGTTTGGATATATTGGT 



AAATATAAACTTTCATTTTAAACAGCAAAAAAAAAAAAAAAAAA 

641 +. + + + 2684 

TTTATATTTGAAAGT/^AAATTTGTCGTTTTTTTTTTTTTTTTTT 
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Figurej 12: Expression of hOAT3 in Xenopus oocytes. Oocytes were injected with either 
waterit hOAT3 complementary RNA and then assayed for uptake of labeled PAH* 
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Figure 13: Human Multiple-tissue Northern Blots (Clontech) were hybridized with indicated 
hOAT fcDNAs. Larie numbers corresponds to RNAs extracted from following human tissues: 



. brain 
. heart 

. skeletal muscle 
. colon 
f>. thymus 
. spleen 



7. kidney 

8. liver 

9. small intestine 

10. placenta 

11. lung 

12. peripheral blood leukocytes. 
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(57) Abstract: This invention relates to novel human 
organic anion transporter polypeptides and genes encoding 
such polypeptides. The invention is directed towards the 
isolation, characterization and use of human organic anion 
transporter polypeptides for pharmacological screening 
of substrates, analogues of substrates, and modulators of 
human organic anion proteins. This invention also provides 
cell cultures, which stably contain nucleic acid sequences 
encoding human organic anion transport polypeptides. 
Such cell lines are useful for screening compounds in vitro 
to identify and characterize modulators, inhibitors and 
substrates of human organic anion transporter polypeptides. 
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ORGANIC ANION TRANSPORTER GENES AND PROTEINS 



Related Applications 

This application claims the benefit of U.S. Serial No. 60/143,771, entitled 
5 "Organic Anion Transporter Genes and Proteins," which was filed July 12, 1999 and 
which is incorporated by reference herein in its entirety, including figures. 

Field of the Invention 

The present invention relates to specific proteins as well as recombinant versions 
10 of these proteins which are organic anion transporters. These proteins include human 
membrane proteins preferentially found in kidney and liver cells. The present invention 
also relates to nucleotide sequences encoding these novel organic anion transporters. In 
another aspect, the present invention relates to methods for using these proteins as in vitro 
screening agents to identify substrates for the proteins and inhibitors which block transport 
1 5 activity. In yet another aspect, these proteins may be used in in vitro assays to predict 
drug pharmacokinetics, drug distribution and drug toxicity. The invention also discloses 
antibodies that specifically bind to these proteins and which may be used in diagnostic 
assays. 

20 Background of the Invention 

The following describes certain relevant art, none of which is admitted to be prior 
art to the inventions described herein. 

The liver and kidney are two organs that can extract a variety of organic anions 
from circulation, including endogenous compounds such as bile acids and bilirubin and 

25 xenobiotics such as sulfobromophthalein (BSP) and /?-aminohippurate (PAH). The liver 
and kidney thus have the critical functions of bile secretion, detoxification, and drug 
elimination. Since the cell membrane represents a hydrophobic barrier that prevents influx 
of charged or hydrophilic molecules, hepatocytes and kidney epithelial cells express 
proteins on their basolateral membranes to facilitate transport of organic anions into the 

30 cell. The liver and kidney contain membrane proteins that transport very specific 

compounds such as taurocholic acid or prostaglandins. However, they also have proteins 
that exhibit a wider substrate specificity. Recently, transporters of the latter type have 
been cloned from rat liver and kidney. One such protein is called the organic anion 
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transporting polypeptide, oatp (Jacquemin et al. (1994) PNAS 91 : 133-37). This protein 
belongs to a family of related transporters, members of which consists of oatp 1, oatp2, 
oatp3, and prostaglandin transporters. These proteins share homology at the amino acid 
level (Noe et al. (1997) PNAS 94: 10346-50; Kanai et al. (1995) Science 268: 866-69; Abe 
5 et al. (1998) J. Biol Chem. 273: 22395-401). When expressed mXenopus oocytes or in 
mammalian cells, members of the oatp family were shown to transport many types of 
organic anions including taurocholic acid, BSP, and conjugated steroid hormones. The 
related prostaglandin transporters show high affinity transport of prostaglandins. 

Shortly after the cloning of oatp 1, members of another organic anion transporter 

10 family, OAT1 and OAT2, were isolated from rat kidney and liver, respectively (Sekine et 
al. (1997) J. Biol Chem 272: 18526-29; Sekine et al. (1998) FEBS Letters 429: 179-82). 
Rat OAT1 and OAT2 are homologous to each other but show no significant homology to 
members of the oatp family. However, they show some homology to the family of organic 
cation transporters (OCTs), suggesting that these two families of proteins share a common 

15 origin. Rat OAT 1 and OAT2 can transport many different organic anions. OATl,when 
expressed in oocytes, can transport /?-aminohippurate (PAH), methotrexate, and glutarate. 
OAT1 is most likely the molecular entity responsible for the classical jp-aminohippurate/a- 
ketoglutarate exchanger found in the kidney proximal tubule. OAT2, expressed 
predominantly in the liver, has been shown to transport organic anions such as salicylate, 

20 methotrexate, and a-ketoglutarate. Therefore, OAT1 and OAT2 belong to yet another 
family of multi-specific organic anion transporters distinct from the oatp family. 



Summary of the Invention 

The present invention is directed to a group of polypeptides, preferentially 
25 expressed in the liver and kidneys of humans and which have activity as organic anion 
transporters. We refer to these polypeptides as "human organic anion transporters" or 
"hOAT polypeptides". These proteins and their properties are described in detail below. 

The invention also concerns, nucleic acids encoding hOAT polypeptides, cells, 
tissues and animals containing such nucleic acids, antibodies to the polypeptides, assays 
30 utilizing the polypeptides, and methods relating to all of the foregoing. 

A first aspect of the invention features an isolated, enriched, or purified nucleic 
acid molecule encoding an hOAT polypeptide or encoding a fragment of an hOAT 
polypeptide. 
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In preferred embodiments the isolated nucleic acid comprises a nucleic acid 
sequence set forth in SEQ ED NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4. SEQ ID 
NO:5, SEQ ID NO:6, or a functional derivative thereof, a nucleic acid sequence that 
hybridizes to the nucleic acid sequence set forth in SEQ ID NO: 1 , SEQ ID NO:2, SEQ ID 
5 NO:3, SEQ ID NO:4 % SEQ ID NO:5 t SEQ ID NO:6, or a functional derivative thereof. 

The nucleic acid may be isolated from a natural source by cDNA cloning or 
subtractive hybridization; the natural source may be mammalian (human) blood, semen, or 
tissue and the nucleic acid may be synthesized by the triester or other method or by using 
an automated DNA synthesizer. Preferably, the nucleic acid is isolated from mammalian 
1 0 kidney or brain, most preferably from liver. 

Individual clones isolated from a cDNA library may be purified to electrophoretic 
homogeneity. The claimed DNA molecules obtained from these clones can be obtained 
directly from total DNA or from total RNA. The cDNA clones are not naturally 
occurring, but rather are preferably obtained via manipulation of a partially purified 
1 5 naturally occurring substance (messenger RNA). The construction of a cDNA library 
from mRNA involves the creation of a synthetic substance (cDNA) and pure individual 
cDNA clones can be isolated from the synthetic library by clonal selection of the cells 
carrying the cDNA library. Thus, the process which includes the construction of a cDNA 
library from mRNA and isolation of distinct cDNA clones yields an approximately 10 6 - 
20 fold purification of the native message. Thus, purification of at least one order of 

magnitude, preferably two or three orders, and more preferably four or five orders of 
magnitude is expressly contemplated. 

In another preferred embodiment, the nucleic acid molecules of the invention 
comprise nucleotide sequences that (a) have the nucleic acid sequences set forth in SEQ 
25 ID NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, or a 
functional derivative thereof, or encode polypeptides having the full-length amino acid 
sequences set forth in SEQ ID NO:7, SEQ ID NO:8; SEQ ID NO:9, SEQ ID NO: 10, SEQ 
ID NO:l 1. SEQ ID NO: 12, or a functional derivative thereof; (b) is the complement of the 
nucleotide sequence of (a); (c) hybridizes under highly stringent conditions to the 
30 nucleotide molecules of (a) and encodes a naturally occurring hOAT polypeptide; (d) 

encodes an hOAT polypeptide having the full-length amino acid sequence of the sequence 
set forth in SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO:ll, 
or SEQ ED NO:l 2 except that it lacks one or more of the domains selected from the group 
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consisting of an extracellular domain, a transmembrane domain, and an intracellular 
domain; or (e) is the complement of the nucleotide sequence of (d). 

The nucleic acid molecules of the invention are isolated, enriched, or purified 
preferably from a mammal, more preferably from a human. 

5 Hybridization techniques may be used to isolate a nucleic acid molecule of interest. 

Various low or high stringency hybridization conditions may be used depending upon the 
specificity and selectivity desired. Stringency is controlled by varying salt or denaturant 
concentrations. Highly stringent conditions may mean conditions that are at least as 
stringent as the following: hybridization in 50% formamide, 5x SSC, 50 mM NaH 3 P0 4 , 

10 pH 6.8, 0.5% SDS, 0. 1 mg/mL sonicated salmon sperm DNA, and 5x Denhart solution at 
42 °C overnight; washing with 2x SSC. 0.1% SDS at 45 °C; and washing with 0.2x SSC, 
0.1% SDS at 45 °C. Those skilled in the art will recognize how such conditions can be 
varied to vary specificity and selectivity. 

Particularly preferred embodiments of this aspect of the invention are naturally or 

15 non-naturally occurring variants of the nucleic acids of the invention. Among variants in 
this regard are variants that differ from the aforementioned nucleic acid molecules by 
nucleotide substitutions, deletions or additions. The substitutions, deletions or additions 
may involve one or more nucleotides. The variants may be altered in coding or non- 
coding regions or both. Alterations in the coding regions may produce conservative or 

20 non-conservative amino acid substitutions, deletions or additions. Such nucleic acid 

molecules are identifiable as being able to hybridize to or which are at least about 60-65% 
percent identical, preferably at least about 70-75% percent identical, more preferably at 
least about 80-83% percent identical, and even more preferably at least about 87-95% 
percent identical to the nucleic acid molecules shown in SEQ ID NO:l, SEQ ID NO:2, 

25 SEQ ID NO:3. SEQ ID NO:4, SEQ ID NO:5 or SEQ ID NO:6. 

Among the particularly preferred embodiments of the invention in this regard are 
nucleic acid molecules encoding polypeptides having the amino acid sequences set forth in 
SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO:l 1, SEQ ID 
NO:12; variants, analogs, derivatives and fragments thereof, and fragments of the variants, 

30 analogs and derivatives. 

Further particularly preferred in this regard are nucleic acid molecules encoding 
OAT variants, analogs, derivatives and fragments, and variants, analogs and derivatives of 
the fragments, which have the amino acid sequence for the hOAT polypeptides of SEQ ID 
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NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO: 1 1, SEQ ID NO: 12 in 
which several, a few, 5 to 10, 1 to 5, 1 to 3, 2, 1 or no amino acid residues are substituted, 
deleted or added, in any combination. Especially preferred among these are silent 
substitutions, additions and deletions, which do not alter the properties and activities of the 

5 OAT polypeptides. Also especially preferred in this regard are conservative substitutions. 
In one embodiment, the polypeptide encoded by the nucleic acid molecule is at least about 
30-35%, preferably at least about 40-45%, more preferably at least about 50-55%, even 
more preferably at least about 60-65%, yet more preferably at least about 70-75%, still 
more preferably at least about 80-85%, and most preferably at least about 90-95% or more 

10 identical to the amino acid sequence of SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ 
ID NO: 10, SEQ ID NO:l 1, or SEQ ID NO:12. 

In still another embodiment, the nucleic acid molecule encodes a naturally 
occurring variant of the polypeptide of SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ 
ID NO: 1 0, SEQ ED NO: 1 1 , or SEQ ID NO: 12 and hybridizes under stringent conditions to 

15 a nucleic acid molecule comprising SEQ ID NO:l, SEQ ED NO:2, SEQ ID NO:3, SEQ ID 
NO:4, SEQ ID NO:5 or SEQ ID NO:6, respectively. 

Further preferred embodiments are variaSFnts of the herein described nucleic acid 
molecules in which the transporting activity of the polypeptides is altered. Such 
alterations can result in either increased, or decreased transport of substrates recognized by 

20 hOAT polypeptides. Further, variants can be made that result in OAT polypeptides which 
are no longer responsive to cellular signals, for example, protein kinase C deactivation of 
transport. 

Especially preferred are nucleic acid molecules encoding OATs which are 
expressed on the basolateral (sinusoid) membrane of hepatocyte cells. 

25 Included in the invention are all possible nucleic acid sequences that encode the 

amino acid sequences of the hOAT polypeptides of the invention. As it is recognized that 
alternate codons will encode the same amino acid for most amino acids due to the 
degeneracy of the genetic code, the sequences of this aspect includes nucleic acid 
sequences utilizing such alternate codon usage for one or more codons of a coding 

30 sequence. For example, all four nucleic acid sequences OCT, GCC, GCA, and GCG 

encode the amino acid, alanine. Therefore, if for an amino acid there exists an average of 
three codons, a polypeptide of 100 amino acids in length will, on average, be encoded by 
3 100 , or 5 x 10 47 , nucleic acid sequences. Thus, a nucleic acid sequence can be modified 
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(e.g., a nucleic acid sequence from an hOAT as specified above) to form a second nucleic 
acid sequence encoding the same polypeptide as encoded by the first nucleic acid 
sequence using routine procedures and without undue experimentation. Thus, all possible 
nucleic acid sequences that encode the amino acid sequences of the hOAT polypeptides of 

5 the invention are included in the present invention and are described, as if all were written 
out in full, taking into account the codon usage. 

The alternate codon descriptions are available in common textbooks, for example, 
Stryer, BIOCHEMISTRY 3 rd ed., and Lehninger, BIOCHEMISTRY 3 rd ed. Codon 
preference tables for various types of organisms are available in the literature. Because of 

10 the number of sequence variations involving alternate codon usage, for the sake of brevity, 
individual sequences are not separately listed herein. Instead the alternate sequences are 
described by reference to the natural sequence with replacement of one or more (up to all) 
of the degenerate codons with alternate codons from the alternate codon table, preferably 
with selection according to preferred codon usage for the normal host organism or a host 

1 5 organism in which a sequence is intended to be expressed. Those skilled in the art also 

understand how to alter the alternate codons to be used for expression in organisms where 
certain codons code differently than shown in the lt universaT codon table. 

In a second aspect, the invention relates to a nucleic acid vector comprising a 
nucleic acid molecule encoding an hOAT polypeptide and a promoter element effective to 

20 initiate transcription in a host cell. 

Those skilled in the art would recognize that a nucleic acid vector can contain 
many other nucleic acid elements besides the promoter element and the hOAT nucleic acid 
molecule. These other nucleic acid elements include, but are not limited to, origins of 
replication, ribosomal binding sites, nucleic acid sequences encoding drug resistance 

25 enzymes or amino acid metabolic enzymes, and nucleic acid sequences encoding secretion 
signals, periplasm or peroxisome localization signals, or signals useful for polypeptide 
purification. 

A circular double stranded nucleic acid molecule can be cut and thereby linearized 
upon treatment with restriction enzymes. An assortment of vectors, restriction enzymes, 
30 and the knowledge of the nucleotide sequences that the restriction enzymes operate upon 
are readily available to those skilled in the art A nucleic acid molecule of the invention 
can be inserted into a vector by cutting the vector with restriction enzymes and ligating the 
two pieces together. 

6 
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Many techniques are available to those skilled in the art to facilitate transformation 
or transfection of the expression construct into a prokaryotic or eukaryotic organism. 
These methods involve a variety of techniques, such as treating the cells with high 
concentrations of salt, an electric field, or detergent, to render the host cell outer 

5 . membrane or wall permeable to nucleic acid molecules of interest. 

In a third aspect, the invention features a nucleic acid probe for the detection of a 
nucleic acid encoding an hOAT polypeptide, fragment or analogue in a sample. The 
nucleic acid probe contains nucleotides that will hybridize specifically to a sequence of at 
least 14 contiguous nucleotides set forth in SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3, 

10 SEQ ID NO:4, SEQ ID NO:5, or SEQ ID NO:6, or a functional derivative thereof The 
probe is preferably at least 14 or more bases in length and selected to hybridize 
specifically to a unique region of an hOAT encoding nucleic acid. 

In preferred embodiments, the nucleic acid probe hybridizes to at least 14 
nucleotides of a nucleic acid encoding the full-length sequence set forth in SEQ ID NO:7, 

15 SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO:ll, SEQ ID NO:12, or a 
functional derivative thereof. Various low or high stringency hybridization conditions 
may be used depending upon the specificity and selectivity desired. Under highly 
stringent hybridization conditions, only highly complementary nucleic acid sequences 
hybridize. Preferably, such conditions prevent hybridization of nucleic acids having 1 or 2 

20 mismatches out of 20 contiguous nucleotides. 

In another preferred embodiment, the nucleic acid is an isolated conserved or 
unique region, for example, those useful for the design of hybridization probes to facilitate 
identification and cloning of additional polypeptides, or for the design of PCR probes to 
facilitate amplification or cloning of additional polypeptides. Such a nucleic acid may. 

25 contain additional sequences in addition to the conserved or unique region, preferably 10- 
20 additional nucleotides on each side of the conserved or unique region, more preferably 
30-40 additional nucleotides, and most preferably 75-100 additional nucleotides. 

Methods for using the probes include detecting the presence or amount of hOAT 
polypeptide RNA in a sample by contacting the sample with a nucleic acid probe under 

30 conditions such that hybridization occurs and detecting the presence or amount of the 
probe bound to hOAT polypeptide RNA. The nucleic acid duplex formed between the 
probe and a nucleic acid sequence coding for an hOAT polypeptide may be used in the 
identification of the sequence of the nucleic acid detected (for example see, Nelson et al., 

7 
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in Nonisotopic DNA Probe Techniques, p. 275 Academic Press, San Diego (Kricka, ed., 
1992) hereby incorporated by reference herein in its entirety, including any drawings). 
Kits for performing such methods may be constructed to include a container having 
disposed therein a nucleic acid probe. 
5 Another feature of the invention is a nucleic acid molecule as set forth in SEQ ID 

NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, or SEQ ID NO:6, or 
fragments thereof, comprising one or more regions that encode an hOAT polypeptide or 
an hOAT domain polypeptide, where the hOAT polypeptide or the hOAT domain 
polypeptide is fused to a non-hOAT polypeptide or amino-terminal tag. Possible non- 
10 hOAT polypeptide or amino-terminal tag fusion partners include, for example, but are not 
limited to, glutathione-S-transferase (GST)-fusion proteins, green fluorescent protein 
(GFP) and fusions with histidine residues as described by Janknecht et al., 1991, Proc. 
Natl ScL USA 88:8972-8976. 

The invention also features recombinant nucleic acid, preferably in a cell or an 
15 organism. The recombinant nucleic acid may contain a sequence set forth in SEQ ID 

NO: 1 , SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, or SEQ ID NO:6 or a 
functional derivative thereof and a vector or a promoter effective to initiate transcription in 
a host cell. The recombinant nucleic acid can alternatively contain a transcriptional 
initiation region functional in a cell, a sequence complimentary to an RNA sequence 
20 encoding an hOAT polypeptide and a transcriptional termination region functional in a 
cell. 

Another aspect of the invention relates to a recombinant cell or tissue comprising a 
nucleic acid molecule encoding an hOAT polypeptide or hOAT polypeptide fragment. 
The recombinant cell may comprise a nucleic acid molecule encoding an hOAT 
25 polypeptide; an hOAT domain polypeptide; an hOAT polypeptide fragment or an hOAT 
polypeptide or polypeptide fragment fused to a non-hOAT polypeptide. The recombinant 
cell can harbor a nucleic acid vector that is extragenomic or intragenomic. 

Extragenomic vectors are designed with their own origins of replication allowing 
them to utilize the recombinant cell replication machinery to copy and propagate the 
30 vector nucleic acid sequence. These vectors are small enough that they are not likely to 
harbor nucleic acid sequences homologous to genomic sequences of the recombinant cell. 

Multiple intragenomic vectors are available to those skilled in the art and contain 
nucleic acid sequences that are homologous to nucleic acid sequences in a particular 

8 
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organism's genomic DNA. These homologous sequences will result in recombination 
events that integrate portions of the vector into the genomic DNA. Those skilled in the art 
can control which nucleic acid sequences of the vector are integrated into the cell genome 
by flanking the portion to be incorporated into the genome with homologous sequences in 
5 the vector. 

A new combination of genes or nucleic acid molecules can be introduced into an 
organism using a wide array of nucleic acid manipulation techniques available to those 
skilled in the art. 

In yet another aspect, the invention features an isolated, enriched, or purified 

10 hOAT polypeptide. 

It is also advantageous for some purposes that an amino acid sequence be in 
purified form. Purification of at least one order of magnitude, preferably two or three 
orders, and more preferably four or five orders of magnitude is expressly contemplated. 
The substance is preferably free of contamination at a functionally significant level, for 

15 example 90%, 95%, or 99% pure. The hOAT polypeptides, or fragments thereof, of the 
present invention are preferably isolated, purified, or enriched from a mammal or a 
mammalian cell. The mammal is as defined herein and preferably is a mouse, and most 
preferably is a human. These polypeptides may be isolated, purified, or enriched from a 
cell that comprises an endogenous nucleic acid molecule that encodes the polypeptide, or 

20 from a cell that is transformed with a nucleic acid molecule that encodes the polypeptide. 
The polypeptide may also be chemically synthesized. Procedures for obtaining 
polypeptides using the above methods are well known to those skilled in the art. 

The polypeptides of the invention comprise an amino acid sequence having (a) the 
full-length amino acid sequence encoded by the nucleic acid sequences set forth in SEQ 

25 ID NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, or 
functional derivatives thereof, or the amino acid sequences set forth in SEQ ID NO:7, 
SEQ ID NO:8; SEQ ID NO:9, SEQ ID NO: 10, SEQ ED NO:l 1 , SEQ ID NO: 12, or 

functional derivatives thereof. 

The invention also features an isolated, enriched, or purified hOAT polypeptide 

30 fragment. 

Examples of polypeptide fragments include hOAT domains, and hOAT-specific 
epitopes. Such fragments may, for example, be produced by proteolytic cleavage of the 
full-length protein. Preferably, the fragment is obtained recombinantly by appropriately 

9 
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modifying the DN A sequence encoding the proteins to delete one or more amino acids at 
one or more sites of the C-termmus, N-terminus, and/or within the native sequence. 
Fragments of a protein can be used as antigens for the production of hOAT-specific 
antibodies, or used as competitors of substrates forhOAT polypeptides. It is understood 
5 that such fragments may retain one or more characterizing portions of the native complex. 
Examples of such retained characteristics include: transporting activity; substrate 
specificity; interaction with other molecules in the intact cell; regulatory functions; or 
binding with an antibody specific for the native complex, or an epitope thereof 
Well-known examples of domains are the SH2 (Src Homology 2) domain 

10 (Sadowski, et al., (1986) Mol Cell BioL 6:4396; Pawson and Schlessinger, (1993) Curr. 
Biol. 3:434), the SH3 domain (Mayer, et al., (1988) Nature 332:272; Pawson and 
Schlessinger, (1993) Curr. BioL 3:434), and pleckstrin (PH) domain (Ponting, (1996) TIBS 
21:245; Haslam, et al., (1993) Nature 363:309), all of which are domains that mediate 
protein :protein interaction, and the kinase catalytic domain (Hanks and Hunter, (1995) 

1 5 FASEB J 9:576-595). The relative homology is at least 20%, more preferably at least 30% 
and most preferably at least 35%. Computer programs designed to detect such homologies 
are well known in the art. 

Comparisons between the sequences of two or more polynucleotides or 
polypeptides can be performed using the local homology algorithm of Smith and 

20 Waterman, (198 1) Adv. Appl Math. 2:482, by the homology alignment algorithm of 
Needleman and Wunsch, (1970) J. Mol. BioL 48:443, or the method of Pearson and 
Lipman, (1988) PNAS 85:2444. Computer programs implementing these methods can be 
used and include, BLAST, GAP, BESTFIT, FASTA, and TFASTA which are offered in 
the Wisconsin Genetics Software Package, Genetics Computer Group, 575 Science Dr., 

25 Madison, Wis. 

Protein motifs can be identified using computer programs readily available to those 
skilled in the art. One such computer program is offered by the Institute for Chemical 
Research at Kyoto University. The Kyoto computer program is offered at the following 
web site (http://www.motif.eenome.ad.ip/) . 
30 Protein topology, such as the orientation and location of transmembrane helixes 

can also be identified using readily available computer programs. One such program is 
TopPred2 offered by Stockholm University and offered at the following web site 
(http://www.biokemi.su.se/-server/toppred2/). 

10 
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Another aspect of the invention features an isolated, enriched or purified hOAT 

polypeptide analog. 

The hOAT polypeptide analogs of the present invention preferably have a 
substantially similar biological activity to the proteins encoded by the full-length nucleic 
5 acid sequence set forth in SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, 
SEQ ED NO:5, or SEQ ID NO:6, or the amino acid sequence set forth in SEQ ID NO:7, or 
SEQ ED NO:8; SEQ ID NO:9, SEQ ID NO: 10, SEQ ED NO:l 1, or SEQ ID NO: 12. 

A sequence that is substantially similar will preferably have at least 85% identity 
(more preferably at least 90% and most preferably 95-100%) to the amino acid sequence 
1 0 encoded by the nucleic acid sequence set forth in SEQ ID NO: 1 , SEQ ID NO:2, SEQ ID 
NO:3, SEQ ID NO:4, SEQ ID NO:5, or SEQ ID NO:6, or the amino acid sequence set 
forth in SEQ ID NO:7, SEQ ID NO:8; SEQ ID NO:9, SEQ ED NO: 10, SEQ ID NO: 1 1, or 
SEQ ID NO: 12. 

An hOAT polypeptide analog will retain some useful function such as, for 

1 5 example, substrate binding, transport activity, or the ability to bind to an hOAT specific 
antibody (as defined below). The hOAT polypeptide analog may be derived from a 
naturally occurring complex component by appropriately modifying the protein DNA 
coding sequence to add, remove, and/or to modify codons for one or more amino acids at 
one or more sites of the C-terminus, N-terminus, and/or within the native sequence. It is 

20 understood that such analogs having added, substituted and/or additional amino acids 
retain one or more characterizing portions of the native hOAT polypeptides. 

Examples of hOAT polypeptides analogs include, but are not limited to, (1) 
chimeric proteins which comprise a portion of an hOAT polypeptide sequence fused to a 
non-hOAT polypeptide sequence, for example, a polypeptide sequence of 

25 glutathione-S-transferase, (2) hOAT polypeptides lacking a specific domain, for example, 
the protein kinase C domain, (3) epitope-tagged hOAT polypeptides or fragments for 
immuno detection or purification and (4) hOAT proteins having a point mutation. 

Human OAT polypeptide analogs with deleted, inserted and/or substituted amino 
acid residues may be prepared using standard techniques well known to those of ordinary 

30 skill in the art. For example, the modified components of the analogs may be produced 

using site-directed mutagenesis techniques (as exemplified by Adelman et al., (1983) DNA 
2:183 ) wherein nucleotides in the DNA coding the sequence are modified such that a 
modified coding sequence is produced, and thereafter expressing this recombinant DNA in 
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a prokaryotic or eukaryotic host cell, using techniques such as those described above. 
Alternatively, proteins with amino acid deletions, insertions and/or substitutions maybe 
conveniently prepared by direct chemical synthesis, using methods well known in the art. 
The analogs of the hOAT polypeptides may exhibit the same qualitative biological activity 
5 as the hOAT polypeptides themselves. 

In another aspect, the invention features an antibody (e.g., a monoclonal or 
polyclonal antibody) having specific-binding affinity to an hOAT polypeptide or hOAT 

polypeptide analog or fragment. 

Antibodies having specific-binding affinity to an hOAT polypeptide may be used 

1 0 in methods for detecting the presence and/or amount of an hOAT polypeptide in a sample 
by contacting the sample with the antibody under conditions such that an immunocomplex 
forms and detecting the presence and/or amount of the antibody conjugated to the hOAT 
polypeptide. Diagnostic kits for performing such methods may be constructed to include a 
first container containing the antibody and a second container having a conjugate of a 

15 binding partner of the antibody and a label, such as, for example, a radioisotope. The 
diagnostic kit may also include notification of an FDA approved use and instructions 
therefor. 

For the production of polyclonal antibodies, various host animals maybe 
immunized by injection with the antigen. Various adjuvants may be used to increase the 
20 immunological response, depending on the host species. 

Monoclonal antibodies may be obtained by any technique which provides for the 
production of antibody molecules by continuous cell lines in culture. Monoclonal 
antibodies may be obtained by methods known to those skilled in the art. See, for 
example, Kohler, et al., (1975) Nature 256:495-497, and U.S. Patent No. 4,376,1 1 0 issued 
25 March 8, 1983, to David et al. 

Still further, the invention features a hybridoma which produces an antibody 
having specific-binding affinity to an hOAT polypeptide. 

In preferred embodiments, the hOAT antibody comprises a sequence of amino 
acids that is able to specifically bind an hOAT polypeptide. 
30 The invention features a method for identifying human cells containing an hOAT 

polypeptide or a related sequence. The method involves identifying the novel polypeptide 
in human cells using techniques that are routine and standard in the art, such as those 

12 
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described herein for identifying hOAT (e.g., cloning. Southern or Northern blot analysis. 
Western blot analysis, immunoassay, in situ hybridization, PCR amplification, etc.). 

The invention also features methods of screening cells for natural-binding partners 
of hOAT polypeptides. Binding partners include modulators and downstream signaling 
5 molecules such as adaptor proteins and may be identified by techniques well known in the 
art such as co-immunoprecipitation or by using, for example, a two-hybrid screen. (Fields 
and Song, U.S. Patent No. 5,283,173, issued February 1, 1994, and incorporated by 
reference herein). The present invention also features the purified, isolated or enriched 
versions of the polypeptides identified by the methods described above. 
1 0 The invention provides methods for screening compounds for their ability to 

inhibit, or modulate the biological activity of the human organic anion transporter 
molecules of the invention. In preferred embodiments, cells expressing an hOAT 
polypeptide, including recombinant expression constructs of the invention, are contacted 
with a compound, and transport activity is assayed. In especially preferred embodiments, 
1 5 competition assays, using known human organic anion transporter substrates, such as p- 
aminohippurate, are used to test the ability of a screened compound to interfere (compete) 
with the uptake of a known substrate of an hOAT polypeptide. Additional preferred 
embodiments comprise quantitative analyses of such effects. 

The present invention is also useful for the detection of substrates, analogues of 
20 substrates, inhibitors and modulators, heretofore known or unknown, for the transporters 
of the invention, either naturally occurring or embodied as a drug. In preferred 
embodiments, such substrates, analogues, or modulators may be detected in blood, saliva, 
semen, cerebrospinal fluid, plasma, lymph, vitreous humor, or any other bodily fluid. In 
additional preferred embodiments, the invention provides methods for detecting and 
25 identifying substrates, analogues of substrates, or modulators that preferentially affect 
either the uptake function or the efflux function of the transporters of the invention. 

One method for identifying a substance capable of modulating an hOAT 
polypeptide activity comprises the steps of (a) contacting an hOAT polypeptide with a test 
substance; and Qo) determining whether the substance alters the transporting activity of 

30 said polypeptide. 

Another method of identifying substances capable of modulating the function of an 
hOAT polypeptide comprises the following steps: (a) expressing an hOAT polypeptide in 
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cells; (M adding a compound to the cells; and (c) monitoring the transporting activity of 
the hOAT. 

A method of identifying substrates or inhibitors of hOAT polypeptides comprises 
the following steps: (a) adding a test substrate to a cell line expressing an hOAT 
5 polypeptide; and (b) measuring the intracellular concentration of the test substrate. 

A second method of identifying substrates or inhibitors of hOAT polypeptides 
comprises the following steps: (a) adding a test substrate to a cell line expressing an hOAT 
polypeptide; (b) adding a substrate known to be transported by said hOAT polypeptides; 
(c) measuring whether said test substrate competes with the uptake of the known substrate. 

1 0 Another method of identifying substrates of an hOAT polypeptide comprises the 

following steps: (a) attaching a fluorescent compound onto a test compound; (b) adding 
said compound to a cell line expressing an hOAT polypeptide; and (c) monitoring whether 
said fluorescent compound is taken into said cell line through the use of fluorescent 
microscopy or any other method which can detect fluorescence. 

15 In a preferred embodiment, high- throughput screening employing 96- well plates 

and a microtiter® fluorescence detection system is used to detect potential modulators, 
substrates, analogues of substrates and inhibitors of hOAT polypeptides which can 
compete with binding or transport of fluorescent compounds known to bind or be 
transported by hOAT polypeptides. 

20 In yet another embodiment, uptake of a test substrate by an hOAT expressing cell 

line is measured using knowledge of an hOAT's antiporter activity. Uptake is measured 
by monitoring efflux of an accompanying antiporter molecule. Efflux of the antiporter 
molecule can be detected using techniques known to those skilled in the art, for example, 
radiolabeling of the effluxed molecule and measurement using a scintillation counter. 

25 Measurement of antiporter efflux is also amenable to high-throughput screening. 

A still further aspect of the invention is the identification of substrates, analogues 
of substrates, inhibitors, or modulators that affect the pharmacokinetics of drugs and 
compounds transported by hOAT polypeptides. Substances which decrease the 
transporting activity of hOAT polypeptides are useful for increasing the half-life in the 

30 body of drugs and compounds excreted via human organic anion transporters. 

Vet, another aspect of the invention involves substrates, and modulators, which 
increase the transporting activity of an hOAT. Such substances are advantageous for 
increasing the delivery of a drug or compound, which is transported by an hOAT, to a 
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target organ such as kidney or liver. For example, a modulator which increases the 
transporting activity of hOATl would increase the delivery of PAH or other drugs or 
compounds transported by hOATl to the kidney. 

A last aspect of the invention features a cultured cell line which stably expresses an 
5 hOAT polypeptide. Such cell lines are useful for identifying substrates which are 

transported by hOAT polypeptides, inhibitors of hOAT polypeptides and for identifying 
modulators of hOAT polypeptides. Further, a panel of cells stably expressing hOAT 
polypeptides will be useful in studying the pharmacokinetics of various drugs and 
compounds. Preferably, the expression of an hOAT polypeptide in such cell lines is under 
1 0 the control of an inducible promoter. More preferably, said inducible promoter has a very 
low basal level of expression in such cell lines and high expression when induced. 

A cell line stably expressing an hOAT polypeptide is useful for designing 
substrates which can be transported by an hOAT polypeptide at either an increased or a 
decreased rate. Decreasing the rate of transport can increase the effectiveness of some 
15 medicinals by making their clearance from the body occur more slowly. 

Cell lines stably expressing hOAT polypeptides are also useful for the 
identification and development of drugs or compounds targeted to a particular organ. For 
example, drugs or compounds identified as substrates of hOATl would be good 
candidates for treating diseases of the kidneys whereas those identified as substrates of 
20 hOAT2 would be good candidates for treating diseases of the liver. 

The OAT expressing cell lines are also useful for identifying potentially toxic 
compounds or drugs. For example, newly discovered or modified drugs belonging to a 
class of drugs known to be toxic to certain organs, e.g., kidney can be tested for transport 
against the panel of cells expressing human OAT1 or hOAT3 polypeptides. If the new 
25 drug or compound is transported by hOATl expressing cells, the new drug or compound 
may also be toxic to the kidney. However, if the new drug or compound, belonging to a 
class known to be toxic, is not transported by hOATl or hOAT3 expressing cells than the 
new drug or compound would be expected to be less toxic to the kidney than the parent 
class of drugs or compounds. 
30 A preferred embodiment includes the identification of toxic compounds which are 

substrates of hOATs, and which can be used to treat diseases of organs or tissues that 
express hOATs. For example, toxic compounds found to be transported by hOATl or 
hOAT3 would have potential use as anti-cancer drugs to treat cancers of the kidney. 

15 



01O4283A3 _IA> 



WO 01/04283 PCT/US00/18980 

The methods of the present invention can utilize any of the molecules disclosed in 
the invention. These molecules include nucleic acid molecules encoding hOAT 
polypeptides, nucleic acid vectors, recombinant cells, polypeptides, or antibodies 
described herein. 

5 The summary of the invention described above is non-limiting and other features 

and advantages of the invention will be apparent from the following detailed description, 
and from the claims. 

Definitions 

10 By "isolated" in reference to nucleic acid it is meant a polymer preferably 

consisting of 14 or more nucleotides conjugated to each other, including DNA or RNA 
that is isolated from a natural source or that is synthesized. The isolated nucleic acid of 
the present invention is unique in the sense that it is not found in a pure or separated state 
in nature. Use of the term "isolated" indicates that a naturally occurring sequence has 

15 been removed from its normal cellular (i.e., chromosomal) environment. Thus, the 

sequence may be in a cell-free solution or placed in a different cellular environment. The 
term does not imply that the sequence is the only nucleotide sequence present, but that it is 
essentially free (about 90-95% pure at least) of nucleotide material naturally associated 
with it and thus is meant to be distinguished from isolated chromosomes. 

20 By the use of the term "enriched" in reference to nucleic acid it is meant that the 

specific DNA or RNA sequence constitutes a significantly higher fraction (2-5 fold) of the 
total DNA or RNA present in the cells or solution of interest than in normal or diseased 
cells or in the ceils from which the sequence was taken. This could be caused by a 
preferential reduction in the amount of other DNA or RNA present, or by a preferential 

25 increase in the amount of the specific DNA or RNA sequence, or by a combination of the 
two. However, it should be noted that "enriched** does not imply that there are no other 
DNA or RNA sequences present, just that the relative amount of the sequence of interest 
has been significantly increased. 

The term "significant" here is used to indicate that the level of increase is useful to 

30 the person making such an increase, and generally means an increase relative to other 
nucleic acids of about at least 2 fold, more preferably at least 5 to 10 fold or even more. 
The term also does not imply that there is no DNA or RNA from other sources. The other 
source DNA may, for example, comprise DNA from a yeast or bacterial genome, or a 
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cloning vector such as pUC19. This term distinguishes the sequence from naturally 
occurring enrichment events, such as viral infection, or tumor type growths, in which the 
level of one mRNA may be naturally increased relative to other species of mRNA. That 
is, the term is meant to cover only those situations in which a person has intervened to 
5 elevate the proportion of the desired nucleic acid. 

The term "purified" in reference to nucleic acid does not require absolute purity 
(such as a homogeneous preparation); instead, it represents an indication that the sequence 
is relatively purer than in the natural environment (compared to the natural level this level 
should be at least 2-5 fold greater, e.g., in terms of mg/mL). The term is also chosen to 

10 distinguish clones already in existence which may encode an hOAT polypeptide but which 
have not been isolated from other clones in a library of clones. Thus, the term covers 
clones encoding an hOAT polypeptide which are isolated from other non-hOAT clones. 

The term "nucleic acid molecule" describes a polymer of deoxyribonucleotides 
(DNA) or ribonucleotides (RNA). The nucleic acid molecule may be isolated from a 

15 natural source by cDNA cloning or subtractive hybridization or synthesized manually. 
The nucleic acid molecule may be synthesized manually by the triester synthetic method 
or by using an automated DNA synthesizer. 

The term "cDNA cloning" refers to hybridizing a small nucleic acid molecule, a 
probe, to cDNA. The probe hybridizes (binds) to complementary sequences of cDNA. 

20 The term "complementary" describes two nucleotides that can form multiple 

favorable interactions with one another. For example, adenine is complementary to 
thymine as they can form two hydrogen bonds. Similarly, guanine and cytosine are 
complementary since they can form three hydrogen bonds. Thus if a nucleic acid 
sequence contains the following sequence of bases, thymine, adenine, guanine and 

25 cytosine, a "complement" of this nucleic acid molecule would be a molecule containing 
adenine in the place of thymine, thymine in the place of adenine, cytosine in the place of 
guanine, and guanine in the place of cytosine. Because the complement can contain a 
nucleic acid sequence that forms optimal interactions with the parent nucleic acid 
molecule, such a complement can bind with high affinity to its parent molecule. 

30 The term "hybridize" refers to a method of interacting a nucleic acid sequence with 

a DNA or RNA molecule in solution or on a solid support, such as cellulose or 
nitrocellulose. If a nucleic acid sequence binds to the DNA or RNA molecule with high 
affinity, it is said to "hybridize" to the DNA or RNA molecule. The strength of the 
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interaction between the probing sequence and its target can be assessed by varying the 
stringency of the hybridization conditions. Under highly stringent hybridization 
conditions, only highly complementary nucleic acid sequences hybridize. Preferably, such 
conditions prevent hybridization of nucleic acids having one or two mismatches out of 20 

5 contiguous nucleotides. 

By "conserved nucleic acid regions", it is meant regions present on two or more 
nucleic acids encoding an hOAT polypeptide, to which a particular nucleic acid sequence 
can hybridize under lower stringency conditions. Examples of lower stringency 
conditions suitable for screening for nucleic acids encoding hOAT polypeptides are 
10 provided in Abe, et al. (1992) J. Biol Chem. 19:13361 (hereby incorporated by reference 
herein in its entirety, including any drawings). Preferably, conserved regions differ by no 
more than 5 out of 20 contiguous nucleotides. 

By 4l unique nucleic acid region" is meant a sequence present in a full-length 
nucleic acid coding for an hOAT polypeptide that is not present in a sequence coding for 
1 5 any other known naturally occurring polypeptide. Such regions preferably comprise 1 2 or 
more contiguous nucleotides present in the full-length nucleic acid encoding an hOAT 
polypeptide. In particular, a unique nucleic acid region is preferably of human origin. 

The term "nucleic acid vector" relates to a single or double stranded circular 
nucleic acid molecule that can be transfected or transformed into cells and replicate 
20 independently or within the host cell genome. 

The terms "trans formation** and ct transfection" refer to methods of inserting an 

expression construct into a cellular organism. 

The term "promoter element" describes a nucleotide sequence that is incorporated 
into a vector that, once inside an appropriate cell, can facilitate transcription factor and/or 
25 polymerase binding and subsequent transcription of portions of the vector DNA into 

mRNA. The promoter element precedes the 5' end of the hOAT polypeptide nucleic acid 
molecule such that the latter is transcribed into mRNA. Host cell machinery then 
translates mRNA into a polypeptide. 

By an "hOAT polypeptide" is meant the full-length amino acid sequence of SEQ 
30 ID NO:7, SEQ ID NO:8, SEQ ID NO:9. SEQ ID NO:10, SEQ ID NO:l U SEQ ID NO:12, 
or a functional derivative thereof. 

The term "recombinant celP refers to a cell that has been altered to contain a new 

combination of genes or nucleic acid molecules. 
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The term "recombinant organism" refers to an organism that has a new 
combination of genes or nucleic acid molecules. 

The term "organism" relates to any living being comprised of at least one cell. An 
organism can be as simple as one eukaryotic cell or as complex as a mammal. Therefore, 
5 a recombinant organism can also be a recombinant cell, which may be a eukaryotic or a 

prokaryotic organism. 

By "recombinant hOAT polypeptide" it is meant to include a polypeptide produced 

by recombinant DNA techniques such that it is distinct from a naturally occurring 

polypeptide either in its location {e.g., present in a different cell or tissue than found in 
1 0 nature), purity or structure. Generally, such a recombinant polypeptide will be present in a 

cell in an amount different from that normally observed in nature. 

The term "eukaryote" refers to an organism comprised of cells that contain a 

nucleus. Eukaryotes are differentiated from "prokaryotes" which do not have a nucleus 

and lack other cellular structures found in eukaryotes, such as mitochondria and 
1 5 endoplasmic reticulum. Prokaryotes include unicellular organisms, such as bacteria, while 

eukaryotes are represented by yeast, invertebrates, and vertebrates. 

The term "extragenomic" refers to a nucleic acid vector which does not insert into 

the cell genome. Thus, these vectors replicate independently of the host genome and do 

not recombine with or integrate into the genome. 
20 The term "intragenomic" defines a nucleic acid construct that is incorporated 

within the cell genome. 

By "isolated" in reference to a polypeptide is meant a polymer of 6, 12, 18 or more 
amino acids conjugated to each other, including polypeptides that are isolated from a 
natural source or that are synthesized. The isolated polypeptides of the present invention 
25 are unique in the sense that they are not found in a pure or separated state in nature. Use 
of the term "isolated" indicates that a naturally occurring sequence has been removed from 
its normal cellular environment. Thus, the sequence may be in a cell-free solution or 
placed in a different cellular environment. The term does not imply that the sequence is 
the only amino acid chain present, but that it is essentially free (about 90-95% pure at 

30 least) of material naturally associated with it. 

By the use of the term "enriched" in reference to a polypeptide it is meant that the 
specific amino acid sequence constitutes a significantly higher fraction (2-5 fold) of the 
total of amino acid sequences present in the cells or solution of interest than in normal or 
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diseased cells or in the cells from which the sequence was taken. This could be caused by 
a preferential reduction in the amount of other amino acid sequences present, or by a 
preferential increase in the amount of the specific amino acid sequence of interest, or by a 
combination of the two. However, it should be noted that "enriched" does not imply that 
5 there are no other amino acid sequences present, just that the relative amount of the 
sequence of interest has been significantly increased. 

The term "significant'' here is used to indicate that the level of increase is useful to 
the person making such an increase, and generally means an increase relative to other 
amino acid sequences of about at least 2 fold, more preferably at least 5 to 1 0 fold or even 

1 0 more. The term also does not imply that there are no amino acid sequences from other 
sources. The other source amino acid sequences may, for example, comprise amino acid 
sequences encoded by a yeast or bacterial genome, or a cloning vector such as pUC19. 
The term is meant to cover only those situations in which a person has intervened to 
elevate the proportion of the desired amino acid sequences. 

15 The term "purified" in reference to a polypeptide does not require absolute purity 

(such as a homogeneous preparation); instead, it represents an indication that the sequence 
is relatively purer than in the natural environment (compared to the natural level this level 
should be at least 2-5 fold greater, e.g., in terms of mg/mL). 

By "an hOAT polypeptide fragment" it is meant an amino acid sequence that is 

20 less than the full-length hOAT amino acid sequences (and which excludes the listed EST 
sequences) encoded by the nucleic acid sequence set forth in SEQ ID NO:l (excluding 
EST clone R25797), SEQ ID NO:2 (excluding EST clone Al 01 6020), SEQ ID NO:3 
(excluding EST clone A 1016020), SEQ ID NO:4 (excluding EST clone H41333), SEQ ID 
NO:5, or SEQ ED NO: 6 (excluding EST clone AA705512) or the amino acid sequence 

25 (excluding the amino acids encoded by the EST clones previously listed set forth in SEQ 
ID NO:7, SEQ ID NO:8; SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO: 11, or SEQ ID 
NO: 12. Thus, the term "fragment" is used to indicate a polypeptide derived from the 
amino acid sequence of the hOAT polypeptides, of the complexes having a length less 
than the full-length polypeptide from which it has been derived. 

30 By "an hOAT domain" it is meant a portion of the hOAT polypeptide having 

homology to amino acid sequences from one or more known proteins wherein the 
sequence predicts some common function, interaction or activity. 

20 



NSDOCID:<WO 0104283A3 IA> 



WO 01/04283 PCT7US00/18980 

By "hOAT polypeptide analog*' it is meant an amino acid sequence substantially 
similar to the sequence encoded by the nucleic acid sequence set forth in SEQ ID NO:l, 
SEQ ID NO:2, SEQ ID NO:3, SEQ ED NO:4, SEQ ID NO:5. or SEQ ID NO:6, or the 
amino acid sequence set forth in SEQ ID NO:7, or SEQ ID NO:8; SEQ ID NO:9, SEQ ID 
5 NO: 10, SEQ ID NO: 1 1, or SEQ ID NO: 12. 

By "biological activity 7 * it is meant an activity of the hOAT polypeptides in a cell. 
The biological activity of the hOAT polypeptides is related to some of the activities of the 
cell which include, but are not limited to, transport (uptake or excretion of a substrate from 
a cell), cell proliferation, mitogenesis, metastasis, tumor escape, cell adhesion, 

1 0 transformation, or apoptosis. 

By "identity" is meant a property of sequences that measures their similarity or 

relationship. Identity is measured by dividing the number of identical residues in the two 

sequences by the total number of residues and multiplying the product by 100. Thus, two 

copies of exactly the same sequence have 100% identity, but sequences that are less highly 
1 5 conserved and have deletions, additions, or replacements have a lower degree of identity. 

Those skilled in the art will recognize that several computer programs are available for 

determining sequence identity. 

An hOAT polypeptide analog may differ from the native sequence of an hOAT 

polypeptide in that one or more amino acids have been changed, added or deleted. 
20 Changes in amino acids may be conservative or non-conservative. By "conservative" it is 

meant the substitution of an amino acid for one with similar properties such as charge, 

hydrophobicity, structure, etc. 

By "hOAT-specific epitope" it is meant a sequence of amino acids that is both 

antigenic and unique to hOAT. 
25 By "specific-binding affinity" is meant that the antibody binds to target (hOAT) 

polypeptides with greater affinity than it binds to other polypeptides under specified 
conditions. 

The term "polyclonal" refers to antibodies that are heterogenous populations of 
antibody molecules derived from the sera of animals immunized with an antigen or an 

30 antigenic functional derivative thereof. 

"Monoclonal antibodies" are substantially homogenous populations of antibodies 

to a particular antigen. 
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The term "antibody fragment" refers to a portion of an antibody, often the 
hypervariable region and portions of the surrounding heavy and light chains, that displays 
specific binding affinity for a particular molecule. A hypervariable region is a portion of 
an antibody that physically binds to the polypeptide target. 
5 By "hybridoma" is meant an immortalized cell line which is capable of secreting 

an antibody, for example an hOAT antibody. 

By "natural-binding partner" it is meant a protein, organic anion, or other molecule 

that interacts with an hOAT polypeptide. 

As used herein, pharmacokinetics is the process by which a drug or compound is 
1 0 absorbed, distributed, metabolized and eliminated by the body. 

The term "compound" includes small organic or inorganic molecules of molecular 
weight of preferably less than 1000 atomic units, more preferably less than 800 atomic 
units, and most preferably less than 500 atomic units. "Organic molecules" include all 
molecules that contain a carbon atom, whereas "inorganic molecules" are those that do not 

1 5 have a carbon atom. 

The term "function" refers to the cellular role of a protein. The role of the proteins 
of the invention may include transport of substrates into and out of a cell, involvement in 
cascades controlling cell growth, migration, differentiation, gene expression, muscle 
contraction, glucose metabolism, cellular protein synthesis, and regulation of the cell 

20 cycle. 

The term "modulates" refers to the ability of a compound to alter the function of an 
hOAT polypeptide. A modulator preferably activates or decreases the transporter activity 
of an hOAT protein. A modulator that increases the transporting activity is a positive 
modulator; and one that decreases the transporting activity is a negative modulator. The 

25 term "modulates" also refers to altering the function of a protein by increasing or 

decreasing the probability that a complex, i.e. an assembly of at least two molecules bound 
to one another, forms between an hOAT protein and a natural-binding partner. 

The term "transporting activity," in the context of the invention, defines the ability 
of a transporter polypeptide to uptake a substrate into a cell or efflux a molecule out of a 

30 cell. 

The term "substrate" as used herein refers to a molecule that is transported into or 
out of a cell by an OAT polypeptide. The substrate may be an organic compound or 
molecule, inorganic compound or molecule, a peptide, or a protein. 
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The term "activates" refers to increasing the transport or efflux of a molecule into 
or out of a cell. 

The term "inhibitor" refers to a compound or substance that binds to a substrate- 
binding site and, thereby, decreases or prevents transport of an hOAT substrate. 
5 The term "expressing" as used herein refers to the production of an hOAT 

polypeptide from a nucleic acid vector containing an hOAT gene within a cell. 

The term "adding" as used herein refers to administering a solution comprising a 

compound to the medium bathing cells. 

The term "functional derivative" with respect to a polypeptide is a polypeptide that 

1 0 possesses a biological activity (either functional or structural) or an immunological 
characteristic that is substantially similar to a biological activity or an immunological 
characteristic of a non-recombinant hOAT. A functional derivative of an hOAT 
polypeptide may or may not contain post-translational modifications such as covalently 
linked carbohydrate, depending on the necessity of such modifications for the performance 

15 of a specific function. The term "functional derivative" is intended to include the 
"fragments", "variants", "analogues", "homologues" or chemical derivatives of a 
molecule. 

Similarly, a "functional derivative" of a gene encoding an hOAT polypeptide of 
the present invention includes "fragments", variants", or "analogues" of the gene, which 
20 may be "substantially similar" in nucleotide sequence, and which encode a molecule 
possessing similar activity to an hO AT polypeptide or fragment thereof. Permutations 
resulting from degeneracy of the genetic code are also considered functional derivatives. 

A molecule is said to be "substantially similar" to another molecule if the sequence 
of amino acids in both molecules is substantially the same. Substantially similar amino 
25 acid molecules will possess a similar biological activity. Thus, provided that two 
molecules possess a similar activity, they are considered variants as that term is used 
herein even if one of the molecules contains additional amino acid residues not found in 
the other, or if the sequence of amino acid residues is not identical. 

A "chemical derivative" of a polypeptide contains additional chemical moieties not 
30 normally a part of the polypeptide. 

The term "mammalian" refers to such organisms as mice, rats, rabbits, goats, more 
preferably monkeys and apes, and most preferably humans. 
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Brief Description of the Figures 

FIG. 1 shows the cDNA and deduced amino acid sequences of human OAT1. 

FIG. 2 shows the cDNA and deduced amino acid sequences of human OAT2A. 

FIG. 3 shows the cDNA and deduced amino acid sequences of human OAT2B. 
5 FIG. 4 shows the cDNA and deduced amino acid sequences of human OAT3. 

FIG. 5 shows the cDNA and deduced amino acid sequences of human OAT4. 

FIG. 6 shows the cDNA and deduced amino acid sequences of human OATS. 

FIG. 7 shows the amount of 3 H-PAH taken up by HeLa cells transfected with 
increasing amounts of hOATl or vector control (LacZ) cDNAs. 
10 FIG. 8 shows that oocytes injected with hOATl RNA uptake 3 H-PAH to a much 

greater extent than water injected oocytes. 

FIG. 9 shows that ECR293 cells stably containing hOATl, in plasmid pIND(spl), 
uptake approximately 50-fold more 3 H-PAH when induced by ponasterone than non- 
induced cells. 

15 FIG. 10 shows that ECR293 cells stably containing hOAT2A, in plasmid 

pIND(spl), uptake approximately 2. 6- fold more 3 H-PAH when induced by ponasterone 
than non-induced cells. 

FIG. 1 1 shows that sulfobromophthalein but not a-ketoglutarate competed with 
3 H-PAH uptake in ECR293/OAT2A cells induced with ponasterone. 
20 FIG. 12 shows that oocytes injected with hOAT3 RNA uptake 3 H-PAH to a much 

greater extent than water injected oocytes. 

FIG. 13 (A-E) shows Northern Blot analyses of hOATl -5 mRNA in various 
human tissues. 
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DETAILED DESCRIPTION OF THE INVENTION 

30 A. Introduction 

Organic anions are carbon-based molecules with one or more negative charges at 
physiological pH. They can be endogenous compounds such as bile acids or xenobiotics 
such as p-aminohippurate. Organic anion transporters are membrane proteins that 
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facilitate the transport of organic anions across the cell membrane. The family described 
herein consists of proteins sharing at least 50% amino acid sequence similarity among 
each other. They show broad substrate specificity, meaning that each member can 
transport a variety of organic anions. Hence, they are often referred to as multi-specific 

5 organic anion transporters. The mechanism of transport, though not currently elucidated, 
probably is a secondary or tertiary active transport involving exchange of another organic 
anion. An example of a tertiary transport system is the rat kidney OAT1 transporter. The 
primary transport system is the Na + /K + -ATPase, which establishes the high out-to-in Na* 
gradient. This Na* gradient then drives the cotransport of Na + and dicarboxylates such as 

10 a-ketoglutarate through a secondary NaVdicarboxylate transporter. Finally, the tertiary 
transport system is established by OAT1 , which exchanges external PAH for intracellular 
a-ketoglutarate. 

The liver and kidney are two organs rich in organic anion transporters. 
Hepatocytes and epithelial cells of the kidney proximal tubules are polarized cells with a 

15 basolateral and apical membrane. For the hepatocyte, the basolateral membrane faces the 
blood-filled sinusoid. Thus, the basolateral membrane is also called the sinusoidal 
membrane. The apical membrane faces the canaliculus, into which bile is secreted. The 
hepatocyte thus can transport organic anions from the blood into the bile canaliculus 
across the basolateral and apical membranes. Similarly, cells of the kidney proximal 

20 tubule transport organic anions from the blood into urine across their basolateral and 
apical membranes. Examples of liver sinusoidal transporters are the oatpl and oatp2, 
whereas OAT1 is a kidney basolateral transporter. 



B. General Method 

25 The practice of the present invention employs conventional techniques of 

molecular and cell biology, microbiology, and recombinant DNA. Most techniques are 
well-described in J. Sambrook et al., "Molecular Cloning; A Laboratory Manual" (1989) 
and J. Celis, ed. "Cell Biology: A Laboratory Handbook" (1998). 



30 Identification of hOAT Nucleic Acid Molecules 

Extensive searches of Genbank databases were performed using database tools 
available through the website maintained by the National Center for Biotechnology 
Information (http://www.ncbi.nlm.nih.gov/). In particular, the Basic Local Alignment 
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Search Tool (BLAST) was used to perform sequence similarity searches of the Genbank 
databases, such as the database containing human expressed sequence tags (ESTs). The 
EST database contains sequences of randomly selected clones from a variety of human 
cDNA libraries. Since each EST clone was sequenced with one primer, about 300-500 bp 
5 of sequence information is available from each clone. The BLAST programs can translate 
these sequences in all possible reading frames and compare them to an input protein 
sequence. Human ESTs showing homology to rat OAT proteins where hereby identified. 

Identified EST clones can be obtained from licensed distributors of the I.M.A.G.E. 
Consortium Lawrence Livermore National Laboratory (LLNL) cDNA clones. The clones 

10 are then sequenced to verify their homologies to published sequences. DNA sequencing is 
usually done by Sanger's method of using dideoxynucleotide triphosphates in a primer 
extension reaction. The DNA to be sequenced, called the template, is first denatured and 
then annealed to a specific primer, usually between 15-25 basepairs. In one method, 35 S- 
adenosine triphosphate is incorporated into the growing DNA strand by a DNA 

15 polymerase. Materials for such a reaction are available from commercial kits such as 
Sequenase 2.0® from Amersham. Reaction products are separated on a long 
polyacrylamide gel, usually 6.0%, by electrophoresis using Tris-borate buffer. After 
electrophoresis, the gel is dried under vacuum and then exposed to autoradiographic film. 
The DNA sequence can be read from the autorad image and entered into computer for 

20 sequence analysis. Alternatively, fluorescent-tagged nucleotides can be incorporated by 
the polymerase chain reaction (PGR) into the template DNA using the thermal-stable 
polymerase from Ttiermus aquaticus (Tag polymerase). The PCR products can then be 
loaded onto an automated sequencer such as the ABI 377 (PE Applied Biosystems, 
Foster City, CA), which separates the products and determines the sequence. 

25 Isolation of cDNA clones from a particular tissue requires the construction of a 

cDNA library representing messages of that tissue. Library construction entails the 
isolation of mRNA from the source tissue and usually a fractionation step to separate low 
(< 0.5 kb) and high molecular weight species. The mRNAs are then primed with a DNA 
oligonucleotide, either a poly-thymidine or a mixture of random primers. Reverse 

30 transcriptase is then added to synthesize the complementary DNA strand. Short DNA 
fragments with sticky ends, called adapters, are ligated to the cDNAs so they can be 
subcloned into an appropriate vector. Popular vectors include bacteriophages such as 
Xgil 1 or XZAP (Stratagene, La Jolla, CA). For phage libraries, the cDNAs are ligated to 
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lambda phage arms at the adapter site and then packaged into phage particles. Once 
packaged, the library can then be amplified by infecting an appropriate bacterial host. 
Ready-made libraries of various human tissues are available from several commercial 
sources. 

5 To isolate a particular gene of interest from the cDNA library, an aliquot is plated 

out onto large plates and then transferred to nitrocellulose or nylon membrane. The phage 
DNA attached to the membrane is denatured in alkali solution, neutralized, and then 
crosslinked to the filter by baking or UV irradiation. The filter is then hybridized with a 
probe of interest. The probe can be an oligonucleotide or a longer, double-stranded 

10 fragment of DNA, such as the insert of an EST clone. The double-stranded probe is 

usually labeled with 32 P to high specific activity (>10 9 dpm/>ig), denatured by boiling for 2 
minutes, and added to a hybridization buffer to concentration of 10 dpm/ml. 
Hybridization buffers can vary in composition depending on the hybridization condition 
desired. One common formulation contains 0.9 M NaCl, 90 mM sodium citrate (pH 7.0), 

15 10 mM EDTA (pH 8.0), 0.1% (w/v) Ficoll, 0.1% (w/v) polyvinylpyrrolidone, 0.1 (w/v) 
BSA, 0.5 % SDS, and 100 |ig/ml sheared, denatured salmon sperm DNA. Other 
formulations are available from commercial sources. The filters are hybridized with the 
probe at 65°C overnight or at least 1 6 hours. They are then washed with sodium citrate 
solutions with decreasing ionic strength and increasing temperature. Finally, they are 

20 exposed to autoradiographic film with an intensifying screen at -80°C overnight. 

If a plaque contains DNA sequences complementary to the probe, the probe will 
hybridize to the complementary sequences on the filter. The plaque containing the 
complementary sequences is detected by exposing X-ray film to the filter. The plaque of 
interest can then be isolated and purified by conventional methods. Furthermore, once 

25 purified, the phage DNA can be manipulated so the relevant cDNA insert is excised from 
the phage into a phagmid by an in vivo excision procedure using appropriate host cells. 
The phagmids can then be amplified in bacteria and isolated by conventional methods 
such as alkaline lysis. Once the plasmid is purified, it can be analyzed by standard means 
such as restriction endonuclease or PCR mapping and DNA sequencing. DNA sequence 

30 information is analyzed for open reading frames and compared to published sequences. 
As discussed earlier, sets of computer programs for DNA and protein sequence analysis 
are commercially available. 



27 



3NSDOCID: <WO 01 04283 A3 )A> 



WO 01/04283 



PCTYUS00/18980 



Once a cDNA coding for a full-length transporter protein is isolated, it can be 
excised from its cloning vector and subcloned into an appropriate expression vector. 
Subcloning is a frequently used procedure in recombinant DNA technology. It involves 
the digestion of DNA by restriction endonucleases, fractionation of the DNA fragments by 

5 agarose gel electrophoresis, and purification of the desired fragment by a DNA-binding 
matrix. The purified fragment is then ligated to a precut vector of choice by adding an 
enzyme such as T4 DNA ligase. The new DNA construct can then be transfected into a 
mammalian cell line to test for functional expression. Many types of expression plasmids 
are commercially available and they all share common elements. An expression plasmid 

10 usually contains a bacterial origin of replication, a DNA promoter that is active in a 

mammalian cell line, a multiple cloning site, a transcriptional termination element, and a 
selective marker that confers resistance to an antibiotic. An expression plasmid harboring 
the cDNA of interest can be transfected into mammalian cells by several conventional 
means like calcium phosphate precipitation or lipofection. Many transfection kits are 

1 5 commercially available. 

Recombinant Cells Containing a Nucleic Acid Molecule E ncoding an hOAT 

Cell lines which stably express hOAT polypeptides can be isolated. Such cells 
may be obtained by using expression vectors which carry nucleic acid sequences which 

20 encode hOAT polypeptides. The expression vectors may contain viral origins of 

replication and/or endogenous expression elements. Further, such expression vectors may 
carry nucleic acid sequences which encode for a selectable marker. 

A nucleic acid molecule, such as DNA, is said to be "capable of expressing" a 
polypeptide if it contains nucleotide sequences which contain transcriptional and 

25 translational regulatory information and such sequences are "operably linked" to 

nucleotide sequences which encode the polypeptide. An operable linkage is a linkage in 
which the regulatory DNA sequences and the DNA sequence sought to be expressed are 
connected in such a way as to permit gene sequence expression. The precise nature of the 
regulatory regions needed for gene sequence expression may vary from organism to 

30 organism, but shall in general include a promoter region which, in prokaryotes, contains 
both the promoter (which directs the initiation of RNA transcription) as well as the DNA 
sequences which, when transcribed into RNA, will signal synthesis initiation. Such 
regions will normally include those S'-non-coding sequences involved with initiation of 
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transcription and translation, such as the TATA box. capping sequence, CAAT sequence, 
and the like. 

If desired, the non-coding region 3' to the sequence encoding an hOAT 
polypeptide of the invention may be retained for its transcriptional termination regulatory 

5 sequences, such as termination and polyadenylation. Thus, by retaining the 3'-region 
naturally contiguous to the DNA sequence encoding an hOAT polypeptide of the 
invention, the transcriptional termination signals may be provided. Where the 
transcriptional termination signals are not satisfactorily functional in the expression host 
cell, then a 3' region functional in the host cell may be substituted. 

10 Two DNA sequences (such as a promoter region sequence and a sequence 

encoding an hOAT polypeptide of the invention) are said to be operably linked if the 
nature of the linkage between the two DNA sequences does not (1) result in the 
introduction of a frame-shift mutation, (2) interfere with the ability of the promoter region 
sequence to direct the transcription of a gene sequence encoding an hOAT polypeptide of 

1 5 the invention, or (3) interfere with the ability of the gene sequence of an hOAT 

polypeptide of the invention to be transcribed by the promoter region sequence. Thus, a 
promoter region would be operably linked to a DNA sequence if the promoter were 
capable of effecting transcription of that DNA sequence. Thus, to express a gene 
encoding an hOAT polypeptide of the invention, transcriptional and translational signals 

20 recognized by an appropriate host are necessary. 

The present invention encompasses the expression of a gene encoding an hOAT 
polypeptide of the invention (or a functional derivative thereof) in either prokaryotic or 
eukaryotic cells. Prokaryotic hosts are, generally, very efficient and convenient for the 
production of recombinant proteins and for the propagation of clones. Prokaryotes most 

25 frequently are represented by various strains of E. coli. However, other microbial strains 
may also be used, including other bacterial strains. 

In prokaryotic systems, plasmid vectors that contain replication sites and control 
sequences derived from a species compatible with the host may be used. Examples of 
suitable plasmid vectors may include pBR322, pUCl 18, pUCl 19 and the like; suitable 

30 phage or bacteriophage vectors may include XgtlO, Xgtl 1 and the like; and suitable virus 
vectors may include pMAM-neo, pKRC and the like. Preferably, the selected vector of the 
present invention has the capacity to replicate in the selected host cell. 
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Recognized prokaryotic hosts include bacteria such as E. colU Bacillus, 
Streptomyces* Pseudomonas, Salmonella, Serratia, and the like. However, under such 
conditions, the polypeptide will not be glycosylated. The prokaryotic host must be 
compatible with the replicon and control sequences in the expression plasmid. 
5 To express an hOAT polypeptide of the invention (or a functional derivative 

thereof) in a prokaryotic cell, it is necessary to operably link the sequence encoding the 
hOAT polypeptide of the invention to a functional prokaryotic promoter. Such promoters 
may be either constitutive or, more preferably, regulatable (i.e., inducible or 
derepressible). Examples of constitutive promoters include the int promoter of 

1 0 bacteriophage X, the bla promoter of the p-lactamase gene sequence of pBR322, and the 
cat promoter of the chloramphenicol acetyl transferase gene sequence of pPR325, and the 
like. Examples of inducible prokaryotic promoters include the major right and left 
promoters of bacteriophage X (P L and Pr), the trp, recA, AacZ. Xacl 9 and gal promoters of 
E. colU the a-amylase (Ulmanen et a/., (1985) J. BacterioL 162:176-182), and the 

1 5 promoters of the bacteriophages of Bacillus (Gryczan, In: The Molecular Biology of the 
Bacilli, Academic Press, Inc., NY, 1982), and Streptomyces promoters (Ward et ah y 
(1986) Mol Gen. Genet. 203:468-478). Prokaryotic promoters are reviewed by Glick 
(1987, Ind. MicrobioU 1:277-282), Cenatiempo (1986, Biochimie 68:505-516), and 
Gottesman(1984,^««. Rev. Genet. 18:415-442). 

20 Proper expression in a prokaryotic cell also requires the presence of a ribosome- 

binding site upstream of the gene sequence-encoding sequence. Such ribosome-binding 
sites are disclosed, for example, by Gold et aL (1981, Ann. Rev. Microbiol. 35:365-404). 
The selection of control sequences, expression vectors, transformation methods, and the 
like, are dependent on the type of host cell used to express the gene. As used herein, 

25 "cell", "cell line", and "cell culture" may be used interchangeably and all such 

designations include progeny. Thus, the words ll transformants M or 'transformed cells" 
include the primary subject cell and cultures derived therefrom, without regard to the 
number of transfers. It is also understood that all progeny may not be precisely identical 
in DNA content, due to deliberate or inadvertent mutations. However, as defined, mutant 

30 progeny have the same functionality as that of the originally transformed cell. 

Host cells which may be used in the expression systems of the present invention 
are not strictly limited, provided that they are suitable for use in the expression of the 
hOAT polypeptide of interest. Especially suitable hosts for expressing the polypeptides of 
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the invention are eukaryotic cells. Preferred eukaryotic hosts include, for example, yeast, 
fungi, insect cells, amphibian oocytes, mammalian cells either in vivo, or in tissue culture. 
Mammalian cells which may be useful as hosts include 3T3-L1 preadipocytes, cells of 
fibroblast origin such as VERO or CHO-K1, or cells of lymphoid origin and their 

5 derivatives. Preferred mammalian host cells include HeLa, EcR293, SP2/0 and J558L, as 
well as neuroblastoma cell lines such as IMR 332, which may provide better capacities for 
correct post-translational processing. 

In addition, plant cells are also available as hosts, and control sequences 
compatible with plant cells are available, such as the cauliflower mosaic virus 35S and 

10 1 9S, and nopaline synthase promoter and polyadenylation signal sequences. Another 
preferred host is an insect cell, for example the Drosophila larvae. Using insect cells as 
hosts, the Drosophila alcohol dehydrogenase promoter can be used (Rubin, (1988) Science 
240:1453-1459)- Alternatively, baculovirus vectors can be engineered to express large 
amounts of hOAT polypeptides of the invention in insect cells (Jasny, (1987) Science 

15 238: 1653; Miller et al, In: Genetic Engineering, Vol. 8, Plenum, Setlow et al, eds., pp. 
277-297, 1986). 

Any of a series of yeast expression systems can be utilized which incorporate 
promoter and termination elements from the actively expressed sequences coding for 
glycolytic enzymes that are produced in large quantities when yeast are grown in mediums 

20 rich in glucose. Known glycolytic gene sequences can also provide very efficient 

transcriptional control signals. Yeast provides substantial advantages in that it can also 
carry out post-translational modifications. A number of recombinant DNA strategies exist 
utilizing strong promoter sequences and high copy number plasmids which can be utilized 
for production of the desired proteins in yeast. Yeast recognizes leader sequences on 

25 cloned mammalian genes and secretes peptides bearing leader sequences («.»., pro- 
peptides). Several possible vector systems are available for the expression of hOAT 
polypeptides of the invention in a mammalian host. 

A wide variety of transcriptional and translational regulatory sequences may be 
employed, depending upon the nature of the host. The transcriptional and translational 

30 regulatory signals may be derived from viral sources, such as adenovirus, bovine 

papilloma virus, cytomegalovirus, simian virus, or the like, where the regulatory signals 
are associated with a particular gene sequence which has a high level of expression. 
Alternatively, promoters from mammalian expression products, such as actin, collagen, 
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myosin, and the like, may be employed. Transcriptional initiation regulatory signals may 
be selected which allow for repression or activation, so that expression of the gene 
sequences can be modulated. Of interest are regulatory signals which are temperature- 
sensitive so that by varying the temperature, expression can be repressed or initiated, or 
5 are subject to chemical (such as metabolite) regulation. 

Expression of hOAT polypeptides of the invention in eukaryotic hosts requires the 
use of eukaryotic regulatory regions. Such regions will, in general, include a promoter 
region sufficient to direct the initiation of RNA synthesis. Preferred eukaryotic promoters 
include, for example, the promoter from the immediate-early gene of human 

10 cytomegalovirus (CMV) (Boshart M. et al., (1985) C<?// 2:521-30, the promoter of the 
mouse metallothionein I gene sequence (Hamer et al 9 (1982) 7. Mol Appl Gen. 1:273- 
288); the TK promoter of Herpes virus (McKnight, (1982) Cell 31 :355-365); the SV40 
early promoter (Benoist et al., (1981) Nature (London) 290:304-31); and the yeast gal4 
gene sequence promoter (Johnston et a/., (1982) Proc. Natl Acad. Set (USA) 79:6971- 

15 6975; Silver et al. % (1984) Proc. Natl Acad. Set (USA) 81:5951-5955). 

Translation of eukaryotic mRNA is initiated at the codon which encodes the first 
methionine. For this reason, it is preferable to ensure that the linkage between a 
eukaryotic promoter and a DNA sequence which encodes an hOAT polypeptide of the 
invention (or a functional derivative thereof) does not contain any intervening codons 

20 which are capable of encoding a methionine (/.<?., AUG). The presence of such codons 

results either in the formation of a fusion protein (if the AUG codon is in the same reading 
frame as the hOAT polypeptide of the invention coding sequence) or a frame-shift 
mutation (if the AUG codon is not in the same reading frame as the hOAT polypeptide of 
the invention coding sequence). 

25 A nucleic acid molecule encoding an hOAT polypeptide of the invention and an 

operably linked promoter may be introduced into a recipient prokaryotic or eukaryotic cell 
either as a nonreplicating DNA or RNA molecule, which may either be a linear molecule 
or, more preferably, a closed covalent circular molecule. Since such molecules are 
incapable of autonomous replication, the expression of the gene may occur through the 

30 transient expression of the introduced sequence. Alternatively, permanent expression may 
occur through the integration of the introduced DNA sequence into the host chromosome. 

A vector may be employed which is capable of integrating the desired gene 
sequences into the host cell chromosome. Cells which have stably integrated the 
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introduced DNA into their chromosomes can be selected by also introducing one or more 
markers which allow for selection of host cells which contain the expression vector. The 
marker may provide for prototrophy to an auxotrophic host, biocide resistance, e.g., 
antibiotics, or heavy metals, such as copper, or the like. The selectable marker gene 

5 sequence can either be directly linked to the DNA gene sequences to be expressed, or 

introduced into the same cell by co-transfection. Additional elements may also be needed 
for optimal synthesis of mRNA. These elements may include splice signals, as well as 
transcription promoters, enhancers, and termination signals. cDNA expression vectors 
incorporating such elements include those described by Okayama (1983, Mol. Cell. Biol. 

10 3:280-289). 

The introduced nucleic acid molecule can be incorporated into a plasmid or viral 
vector capable of autonomous replication in the recipient host. Any of a wide variety of 
vectors may be employed for this purpose. Factors of importance in selecting a particular 
plasmid or viral vector include: the ease with which recipient cells that contain the vector 

1 5 may be recognized and selected from those recipient cells which do not contain the vector, 
the number of copies of the vector which are desired in a particular host; and whether it is 
desirable to be able to "shuttle" the vector between host cells of different species. 

Preferred prokaryotic vectors include plasmids such as those capable of replication 
in£. coli (such as, for example, pBR322, ColEl, pSClOl, pACYC 184; "Molecular 

20 Cloning: A Laboratory Manual", 1989, supra). Bacillus plasmids include pC194, pC221, 
pT127, and the like (Gryczan, In: The Molecular Biology of the Bacilli, Academic Press, 
NY, pp. 307-329, 1982). Suitable Streptomyces plasmids include plJlOl (Kendall et al., 
(1987) J. Bacteriol. 169:4177-4183), Pseudomonas plasmids are reviewed in John et al. 
(1986) Rev. Infect. Dis. 8:693-704), and Izaki (1978) J. Bacteriol. 33:729-742). 

25 Preferred eukaryotic plasmids include, for example, pcDNA3 . 1 (Invitrogen), BPV, 

vaccinia, SV40. 2-micron circle, and the like, or their derivatives. Such plasmids are well 
known in the art (Botstein et al, (1982) Miami Wntr. Symp. 19:265-274; Broach, In: "The 
Molecular Biology of the Yeast Saccharomyces: Life Cycle and Inheritance". Cold Spring 
Harbor Laboratory, Cold Spring Harbor, NY, p. 445-470, 1981; Broach, (1982) Cell 

30 28:203-204; Bollon et al., (1980) J. Clin. Hematol. Oncol. 10:39-48; Maniatis, In: Cell 
Biology: A Comprehensive Treatise, Vol. 3, Gene Sequence Expression, Academic Press, 
NY, pp. 563-608, 1980). 
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Once the vector or nucleic acid molecule containing the construct(s) has been 
prepared for expression, the DNA construct(s) may be introduced into an appropriate Host 
cell by any of a variety of suitable means, i.e., transformation, transfection, conjugation, 
protoplast fusion, electroporation, particle gun technology, calcium phosphate- 

5 precipitation, direct microinjection, and the like. After the introduction of the vector, 
recipient cells are grown in a selective medium, which selects for the growth of vector- 
containing cells. Expression of the cloned gene(s) results in the production of an hOAT 
polypeptide of the invention, or fragments thereof. This can take place in the transformed 
cells as such, or following the induction of these cells to differentiate (for example, by 

1 0 administration of bromodeoxyuracil to neuroblastoma cells or the like). A variety of 

incubation conditions can be used to form the peptide of the present invention. The most 

■ 

preferred conditions are those which mimic physiological conditions. 

Following the introduction of the vector, cells may be allowed to grow for 1-2 days 
in an enriched media before they are switched to selective media* The purpose of the 
1 5 selectable marker is to confer resistance to selection, and its presence allows growth and 

recovery of cells which successfully express the introduced sequences. Resistant clones of 
stably transformed cells maybe proliferated using tissue culture technique appropriate to 
the cell type. 

Cell lines stably expressing human organic anion transporters under the control of 
20 an inducible promoter can also be isolated. For inducible expression of human organic 
anion transporters, the ecdysone system from Invitrogen (San Diego, CA) was used. 
Ecdysone is a hormone that regulates metamorphosis in insects, such as the fruitfly 
Drosophila. Like other steroid hormones, ecdysone exerts its effects by binding to a 
nuclear receptor, the VgR. The hormone-receptor complex then binds to a sequence of 
25 DNA called the ecdysone response element (EcRE), and activates transcription from 
promoters containing these elements. Since mammalian cells do not express ecdysone 
receptors, they normally do not respond to the insect hormone. However, if a piece of 
DNA under the control of an ecdysone inducible promoter is transfected into a mammalian 
cell line, treating the cell line with ecdysone will induce expression of the said DNA. 
30 The cloned transporters can also be expressed in the amphibian oocytes, a widely 

used heterologous system for the expression of membrane proteins. Complementary RNA 
can be synthesized from purified cDNA in vitro if it is downstream of an appropriate 
promoter, such as the T3 or T7 phage promoter. The synthesized KNA are microinjected 
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into each frog oocyte, which translates the message and expresses the protein in its 
membranes. Xenopus oocytes can be obtained by surgical removal of ovaries from the 
frog Xenopus laevis. The ovary lobes are treated with collagenase to remove the vitalin 
layer. Oocytes can be maintained for up to two weeks in Barth solution ( 88 mM NaCl. 1 
5 mM KC1, 2.4 mM NaHC0 3 , 15 mM HEPES (pH 7.6). 0.7 mM CaCl 2 , 0.82 mM MgS0 4 
and 50 ug/ml gentamicin) at 16°C Typically, about 50 nl or 10-30 ng of RNA are 
injected into each oocyte. The injected oocytes are maintained at 16°C for 2-3 days and 
then subjected to transport assays. 
Assays for functional expression 
1 o Assaying for functional expression of transporters can be done using a labeled 

compound that is a known substrate of the transporter. For hOATl , p-arninohippurate 
(PAH) is a high affinity substrate that is rapidly transported. Transfected cells or injected 
oocytes are washed briefly with media or Barth buffer and then incubated with the labeled 
PAH. After the desired incubation time, the excess label is washed away by adding cold 
1 5 buffer. The cells are then solubilized with SDS and the amount of label taken up can be 

determined by scintillation counting. 

Other methods for measuring transport activity of organic anion transporters are 
well known in the art and are applicable to the present invention. For example, transport 
assays utilizing membrane vesicles was demonstrated by Yamazaki, et al., (1997) Drug 
20 Metabolism and Disposition, 1 123-1 129; and Pascolo, et al. (1998) Biochem J.331 :99-103. 

r oll ,,-n.c stshlv ^pressing hflAT polypepti d e »-H «» identify substrates, inhibitors 

and modulators of h OAT p olypeptides 

Cell lines stably expressing hOAT polypeptides can be used to screen for other 
25 substrates or inhibitors of hOAT polypeptides through use of a competition assay. If a test 
compound is well transported by an hOAT protein, it will compete with a substrate, 
known to be transported by a particular hOAT, for access to the transporter. Cells 
expressing hOAT proteins would then uptake less of the known substrate in the presence 

than in the absence of the competitor. 
30 If a test compound competes with transport of a known substrate, further testing is 

necessary to confirm that the test compound is a substrate or inhibitor. Such confirmation 
can be done using techniques known to those skilled in the art, for example, the 
intracellular concentration of the test compound can be measured using HPLC. If a test 
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compound, which competes with transport of a known substrate, is also found to 
accumulate inside a cell, this confirms that the test substrate is a substrate and not an 
inhibitor of the particular hOAT being tested. 

Modulators can be identified by adding a test compound to cell lines stably 

5 expressing hOAT polypeptides and measuring whether the addition of the test compound 
increases or decreases the transport of hOAT substrates. If a modulator is identified, 
which decreases or inhibits the transport of a substrate, further testing, using techniques 
known to those skilled in the art, is necessary to confirm that such modulators are not 
actually competitive inhibitors. One such technique involves adding more substrate in a 

1 0 competition assay to determine if said substrate can compete with the inhibitory effects of 
the putative modulator. If adding more substrate has no affect on the modulating activity 
then the modulator is not a competitive inhibitor. 

Nucleic Acid Probes. Methods, and Kits for Detection of the hOAT Polypeptides 
15 A nucleic acid probe of the present invention may be used to probe an appropriate 

chromosomal or cDNA library by usual hybridization methods to obtain other nucleic acid 
molecules of the present invention. A chromosomal DNA or cDNA library may be 
prepared from appropriate cells according to recognized methods in the art (cf. "Molecular 
Cloning: A Laboratory Manual", second edition, Cold Spring Harbor Laboratory, 
20 Sambrook, Fritsch, & Maniatis, eds., 1 989). 

In the alternative, chemical synthesis can be carried out in order to obtain nucleic 
acid probes having nucleotide sequences which correspond to N-terminal and C-terminal 
portions of the amino acid sequence of the polypeptide of interest. The synthesized 
nucleic acid probes may be used as primers in a polymerase chain reaction (PCR) earned 
25 out in accordance with recognized PCR techniques, essentially according to PCR 

Protocols, "A Guide to Methods and Applications", Academic Press, Michael, et aL 7 eds., 
1990, utilizing the appropriate chromosomal or cDNA library to obtain the fragment of the 
present invention. 

One skilled in the art can readily design such probes based on the sequence 
30 disclosed herein using methods of computer alignment and sequence analysis known in 
the art ("Molecular Cloning: A Laboratory Manual", 1989, supra). The hybridization 
probes of the present invention can be labeled by standard labeling techniques such as 
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with a radiolabel, enzyme label, fluorescent label, biotin-avidin label, chemiluminescence, 
and the like. After hybridization, the probes may be visualized using known methods. 

The nucleic acid probes of the present invention include RNA, as well as DNA 
probes, such probes being generated using techniques known in the art. The nucleic acid 
5 probe may be immobilized on a solid support. Examples of such solid supports include, 
but are not limited to, plastics such as polycarbonate, complex carbohydrates such as 
agarose and sepharose, and acrylic resins, such as polyacrylamide and latex beads. 
Techniques for coupling nucleic acid probes to such solid supports are well known in the 

art. 

1 0 The test samples suitable for nucleic acid probing methods of the present invention 

include, for example, cells or nucleic acid extracts of cells, or biological fluids. The 
samples used in the above-described methods will vary based on the assay format, the 
detection method and the nature of the tissues, cells or extracts to be assayed. Methods for 
preparing nucleic acid extracts of cells are well known in the art and can be readily 

1 5 adapted in order to obtain a sample that is compatible with the method utilized. 

One method of detecting the presence of nucleic acids of the invention in a sample 
comprises (a) contacting the sample with the above-described nucleic acid probe under 
conditions such that hybridization occurs, and (b) detecting the presence of the probe 
bound to the nucleic acid molecule. One skilled in the art would select the nucleic acid 

20 probe according to techniques known in the art as described above. Samples to be tested 
include but should not be limited to RNA samples of human tissue. 

A kit for detecting the presence of nucleic acids of the invention in a sample 
comprises at least one container means having disposed therein the above-described 
nucleic acid probe. The kit may further comprise other containers comprising one or more 

25 of the following: wash reagents and reagents capable of detecting the presence of bound 
nucleic acid probe. Examples of detection reagents include, but are not limited to 
radiolabeled probes, enzymatic labeled probes (horseradish peroxidase, alkaline 
phosphatase), and affinity labeled probes (biotin, avidin, or streptavidin). Preferably, the 
kit further comprises instructions for use. 
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The present invention relates to an antibody having binding affinity to an hOAT 
polypeptide of the invention. The polypeptide may have the amino acid sequence encoded 
by the nucleic acid sequence set forth in SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3, 
SEQ ID NO:4, SEQ ID NO:5, SEQ ED NO:6, or a functional derivative thereof, or the 
5 amino acid sequence set forth in SEQ ID NO:7 or SEQ ID NO:8, SEQ ID NO:9, SEQ ID 
NO: 1 0, SEQ ID NO:l 1 , SEQ ID NO: 12, or a functional derivative thereof, or at least 9 
contiguous amino acids thereof (preferably, at least 20, 30, 40, 50 or 60 contiguous amino 
acids thereof). 

The present invention also relates to an antibody having specific binding affinity to 
1 0 an hO AT polypeptide of the invention. Such an antibody may be isolated by comparing 
its binding affinity to an hOAT polypeptide of the invention with its binding affinity to 
other polypeptides. Those which bind selectively to an hOAT polypeptide of the 
invention would be chosen for use in methods requiring a distinction between an hOAT 
polypeptide of the invention and other polypeptides, including other polypeptides with 
1 5 similar amino acid sequences. Such methods could include, but should not be limited to, 
the analysis of altered hOAT polypeptide expression in tissue containing other 
polypeptides. 

The hOAT polypeptides of the present invention can be used to produce antibodies 
or hybridomas. One skilled in the art will recognize that if an antibody is desired, such a 
20 peptide could be generated as described herein and used as an immunogen. The antibodies 
of the present invention include monoclonal and polyclonal antibodies, as well fragments 
of these antibodies, and humanized forms. Humanized forms of the antibodies of the 
present invention may be generated using one of the procedures known in the art such as 

chimerization or CDR grafting. 
25 The present invention also relates to hybridomas that produce the above-described 

monoclonal antibodies, or binding fragment thereof. A hybridoma is an immortalized cell 

* 

line that is capable of secreting a specific monoclonal antibody. 

In general, techniques for preparing monoclonal antibodies and hybridomas are 
well known in the art (Campbell, "Monoclonal Antibody Technology: Laboratory 
30 Techniques in Biochemistry and Molecular Biology," Elsevier Science Publishers, 
Amsterdam, The Netherlands, 1984; St. Groth et aL, (1980), J. Immunol. Methods 
35:1-21). Any animal (mouse, rabbit, and the like) which is known to produce antibodies 
can be immunized with the selected polypeptide. Methods for immunization are well 
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known in the art. Such methods include subcutaneous or intraperitoneal injection of the 
polypeptide. One skilled in the art will recognize that the amount of polypeptide used for 
immunization will vary based on the animal that is immunized, the antigenicity of the 
polypeptide and the site of injection. 
5 The polypeptide may be modified or administered in an adjuvant in order to 

increase the peptide antigenicity. Methods of increasing the antigenicity of a polypeptide 
are well known in the art. Such procedures include coupling the antigen with a 
heterologous protein (such as globulin or p-galactosidase) or through the inclusion of an 

adjuvant during immunization. 

10 For monoclonal antibodies, spleen cells from the immunized animals are removed, 

fused with myeloma cells, such as SP2/0-Agl4 myeloma cells, and allowed to become 
monoclonal antibody producing hybridoma cells. Any one of a number of methods well 
known in the art can be used to identify the hybridoma cell that produces an antibody with 
the desired characteristics. These include screening the hybridomas with an ELISA assay, 

1 5 western blot analysis, or radioimmunoassay (Lutz et a/., (1988) Exp. Cell Res. 1 75 : 1 09- 
124). Hybridomas secreting the desired antibodies are cloned and the class and subclass 
are determined using procedures known in the art (Campbell, (1984) "Monoclonal 
Antibody Technology: Laboratory Techniques in Biochemistry and Molecular Biology", 
supra). 

20 For polyclonal antibodies, antibody-containing antisera is isolated from the 

immunized animal and is screened for the presence of antibodies with the desired 
specificity using one of the above-described procedures. The above-described antibodies 
may be detectably labeled. Antibodies can be detectably labeled through the use of 
radioisotopes, affinity labels (such as biotin, avidin, and the like), enzymatic labels (such 

25 as horse radish peroxidase, alkaline phosphatase, and the like) fluorescent labels (such as 
FITC or rhodamine, and the like), paramagnetic atoms, and the like. Procedures for 
accomplishing such labeling are well-known in the art, for example, see Stemberger et a/., 
(1970)7. Histochenu Cytochem. 18:315; Bayers aU (1979) Afeth. Enzym. 62:308; Engval 
etaL, {1972) Immunol. 109:129; Goding, J. (1976) Immunol, hteth. 13:215). The labeled 

30 antibodies of the present invention can be used for in vitro, in vivo, and in situ assays to 
identify cells or tissues that express a specific peptide. 

The above-described antibodies may also be immobilized on a solid support. 
Examples of such solid supports include plastics such as polycarbonate, complex 
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carbohydrates such as agarose and sepharose, acrylic resins such as polyacrylamide and 
latex beads. Techniques for coupling antibodies to such solid supports are well known in 
the art (Weir et a/., "Handbook of Experimental Immunology" 4th Ed., Blackwell 
Scientific Publications, Oxford, England, Chapter 10, 1986; Jacoby et al^ (1974) Meth. 
5 Enzym. 34, Academic Press, N.Y.,). The immobilized antibodies of the present invention 
can be used for in vitro, in vivo, and in situ assays as well as in immunochromotography. 

The present invention also encompasses a method of detecting an hOAT 
polypeptide in a sample, comprising: (a) contacting the sample with an above-described 
antibody, under conditions such that immunocomplexes form, and (b) detecting the 
10 presence of said antibody bound to the polypeptide. In detail, the methods comprise 

incubating a test sample with one or more of the antibodies of the present invention and 
assaying whether the antibody binds to the test sample. 

Conditions for incubating an antibody with a test sample vary. Incubation 
conditions depend on the format employed in the assay, the detection methods employed, 
15 and the type and nature of the antibody used in the assay. One skilled in the art will 

recognize that any one of the commonly available immunological assay formats (such as 
radioimmunoassays, enzyme-linked immunosorbent assays, diffusion based Ouchterlony, 
or rocket immunofluorescent assays) can readily be adapted to employ the antibodies of 
the present invention. Examples of such assays can be found in Chard ("An Introduction 
20 to Radioimmunoassay and Related Techniques" Elsevier Science Publishers, Amsterdam, 
The Netherlands, 1986), Bullock et at ('Techniques in Immunocytochemistry," Academic 
Press, Orlando, FL Vol. 1, 1982; Vol. 2, 1983; Vol. 3, 1985), Tijssen ("Practice and 
Theory of Enzyme Immunoassays: Laboratory Techniques in Biochemistry and Molecular 
Biology," Elsevier Science Publishers, Amsterdam, The Netherlands, 1985). 
25 The immunological assay test samples of the present invention include cells, 

protein or membrane extracts of cells, or biological fluids such as blood, serum, plasma, or 
urine. The test samples used in the above-described method will vary based on the assay 
format, nature of the detection method and the tissues, cells or extracts used as the sample 
to be assayed. Methods for preparing protein extracts or membrane extracts of cells are 
30 well known in the art and can readily be adapted in order to obtain a sample which is 
testable with the system utilized. 

A kit contains all the necessary reagents to carry out the previously described 
methods of detection. The kit may comprise: (i) a first container means containing an 
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above-described antibody, and (ii) second container means containing a conjugate 
comprising a binding partner of the antibody and a label. Preferably, the kit also contains 
instructions for use. In another preferred embodiment, the kit further comprises one or 
more other containers comprising one or more of the following; wash reagents and 
5 reagents capable of detecting the presence of bound antibodies. 

Examples of detection reagents include, but are not limited to, labeled secondary 
antibodies, or in the alternative, if the primary antibody is labeled, the chromophoric, 
enzymatic, or antibody binding reagents that are capable of reacting with the labeled 
antibody. The compartmentalized kit may be as described above for nucleic acid probe 
1 0 kits. One skilled in the art will readily recognize that the antibodies described in the 

present invention can readily be incorporated into one of the established kit formats that 

are well known in the art. 

Examples 

Example 1 : Identification of human EST clones homologous to rat organic 
15 anion transporter proteins 

The cloning of rat kidney organic anion transporter OAT1 (Sekine, et aL, (1997) J. 
Biol Chem. 30: 18526-18529) and liver transporter OAT2 (Sekine, et al., (1998) FEBS 
Letters 429: 179-182) has been previously reported. The published amino acid sequences 
of OAT 1 and OAT2 from rat were used as input sequences to perform a BLAST search of 
20 human expressed sequence tags (EST) sequences deposited in Genbank. The BLAST 
search identified several sequences that show amino acid homology to rat OAT1 and 
OAT2. 

EST clones accession Nos. R25797 and AI0 1 6020 show high homology to rat 
OAT1 and OAT2, respectively. The clones isolated using EST clones R25797 and 
25 A1016020 were named hOATl and hOAT2, respectively. The other ESTs show lower 
homology to rat OAT1 and OAT2. These other EST clones have been named hOAT3, 
hOAT4, and hOAT5. The latter three clones are not considered human homologues to the 
rat OAT1 and OAT2. 
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Example 2 : Isolation of full-length human OATl cDNA 
The human EST clone R25797, listed in Example 1 , was purchased from a 
distributor of IMAGE Consortium (LLNL) cDNA Clones (Lennon, et al., Genomics 33: 
151-52 (1996)). EST cDNA clone R25797 was sequenced to verify its homology to rat 
5 OATl. The cDNA insert was excised by Noil and HindUl digestion and purified by 
agarose gel electrophoresis. These DNA fragments were then labeled with 32 P-a- 
deoxyadenosine triphosphate (3000 Ci/mmol) using the Prime-a-Gene labeling system 
(Promega, Madison, WI) to a specific activity greater than 1 x 10 9 dpm/jig. 

A human kidney cDNA library in pTripleEx vector (Clontech, Palo Alto, CA) was 

10 plated out on 10 cm-diameter plates at a density of 5 x 10 4 pfu/plate. The plates were 
incubated at 37°C for 6-8 hours until each plaque was about 0.3-0.5 mm in diameter. 
Plaque lifts were performed by placing a nylon membrane on each plate for 2 minutes. 
The nylon membrane was then lifted carefully from the plate and immersed in 1 .5 M NaCl 
and 0.5 M NaOH for 2 minutes to denature the attached DNA. The nylon was then placed 

15 in 1.5 M NaCl and 0.5 M Tris (pH 8.0) for neutralization. Finally, the membrane was 
quickly rinsed in 2X SSC (0.3 M NaCl and 30 mM sodium citrate). The attached DNA 
was crosslinked by UV irradiation. Filters were incubated in hybridization buffer with 
radioactive probes at a concentration of 1 x 10 6 dpm/ml in roller bottles at 65°C overnight 
The next day, the filters were washed with 2X SSC and 0.1% SDS for 15 minutes at 25°C 

20 for two times. They were then washed with 0.2X SSC and 0. 1 % SDS for 1 hour at 65°C. 
After the final wash, filters were exposed to autoradiographic film with an intensifying 
screen at -80°C overnight. Positive clones, which show up as dark spots on the autorad, 
were identified and a 3 mm-diameter agar plug was removed from the original plate. 

The positive clone was purified by subsequent rounds of hybridization until a well- 

25 isolated plaque could be obtained. The cDNA of interest was excised from the phage into 
a phagmid by an in vivo excision procedure usingBM25.8 cells as host (Clontech, Palo 
Alto, CA). Phagmids were amplified, purified, and sequenced using standard procedures. 
The DNA sequence was analyzed for open reading frames and compared to published 
sequences. 

30 One positive plaque isolated from the above procedure was 2.5 kb in length. 

Analysis of its sequence revealed an open-reading frame with a deduced amino acid 
sequence showing high homology to the rat OATl protein (Table 1). This clone was 
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10 



designated human OAT1 (hOATl), or the human homologue of rat OAT1 . The entire 
cDNA and deduced amino acid sequences of human OAT1 are shown in Figure 1. 

The isolated cDNA encoding for hOATl is predicted to encode a polypeptide with 
a length of 550 amino acids. Computer analysis of the deduced amino acid encoded by 
hOATl cDNA reveals consensus protein kinase C (PKC) phosphorylation sites at amino 

acids 271,278, and 284. 

Computer analysis using the GAP- Alignment program from the Wisconsin 
Genetics Software Package indicates that hOATl polypeptide is 88% identical to rat 
OAT1 polypeptide and 39% identical to rat OAT2 polypeptide (Table 1). GAP analysis 
also indicates that hOATl is 86% identical to rat OAT1 nucleotide sequence (Table 1). 

EST accession number R25797 is homologous to hOATl cDNA at nucleotides 
398-2100 (Figure 1). 



15 



20 



25 



Tablel 



Homology Among Human and Rat Organic Anion Transporters 



rat OAT1 
rat OAT2 



hOATl 
hOAT2A 



hOAT2B 



hOAT3 



hOAT4 



hOAT5 



hOATl 
N 1 (A) 2 



86% (88%) 
53% (39%) 



hOAT2A 
N (A) 



53% (37%) 
82% (79%) 



54% (39%) 



hOAT2B 
N (A) 



53% (37%) 
82% (79%) 




55% (39%) 
99% (98%) 



hOAT3 
N (A) 



60% (50%) 
51% (37%) 



61% (51%) 
52% (38%) 



52% (39%) 



hOAT4 
N (A) 



49% (37%) 



44% (33%) 
52% (37%) 



hOATS 
N (A) 



51% (40%) 



50% (40%) 



45% (32%) 



45% (32%) 
52% (39%) 



72% (55%) 



1 

nucleic 



(N) indicates the percentage of identical nucleotides found between compared 
acid sequences as determined using the GAP- Alignment program in the GCG 
SOftWai ^ackage, using a gap weight of 5.0 and a length weigh of 0.3. 

2 (A) indicates the percentage of identical amino acids found between compared 
polypeptides as determined using the GAP-Ahgnment program in the GCG 

S °^adcage t using a gap weight of 3.0 and a length weight of 0.1. 
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Example 3 : Isolation of full-length human OAT2A and OAT2B cDNA 
Human OAT2 cDNA was cloned in the same manner as described for human 
OAT1 cDNA in Example 1 except that EST accession number- A 1016020 was used as a 
probe. In addition, hOAT2A and hOAT2B were isolated from a cDNA library obtained 
5 from human liver rather than kidney, as was hOATl > 

The entire cDNA and deduced amino acid sequences of human OAT2A are shown 
in Figure 2. Human OAT2A (hOAT2A) cDNA is predicted to encode a polypeptide that 
is 546 amino acids in length. Computer analysis of the deduced amino acid sequence 
using the GAP-Alignment program indicates that human OAT2A polypeptide is 79% 
10 identical to rat OAT2 polypeptide (Table 1). GAP analysis also indicates that the 
nucleotide sequence for hOAT2A is 82? o identical to the nucleotide sequence of rat 
OAT2. 

EST accession number A 101 6020 is homologous to hOAT2A cDNA at nucleotide 
sequences 20-1 400 (Figure 2). 
15 The entire cDNA and deduced amino acid sequences of human OAT2B (hOAT2B) 

are shown in Figure 3. Human OAT2B cDNA is predicted to encode a polypeptide that is 
538 amino acids in length. Human OAT2B is identical to hOAT2A except at its C- 
terminus end. Without wishing to be bound by any particular theory, hOAT2B could be 
an mRNA splice variant of hOAT2A. 

20 

Example 4 : Isolation of full-length human OAT3 cDNA 
Human OAT3 (hOAT3) cDNA was isolated as described for hOATl cDNA in 
Example 1 except that EST accession number-H41333 was used as a probe. The entire 
cDNA and deduced amino acid sequences of human OAT3 are shown in Figure 4. 
25 Computer analysis of the hOAT3 cDNA sequence predicts that hOAT3 cDNA encodes a 
polypeptide that is 542 amino acids in length. Computer analysis of the deduced amino 
acid sequence reveals consensus PKC phosphorylation sites at amino acids 259, 266, 269, 
511, and 527. 

EST accession number H41333 is homologous to hOAT3 cDNA at nucleotides 
30 1169-2121 (Figure 4). 
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Example 5 : Isolation of full-length human OAT4 cDNA 
Human OAT4 (hOAT4") cDNA was cloned in the same manner as described for 
human OAT1 cDNA in Example 1 except that EST accession number-AA705512 was 
used as a probe. In addition, hOAT4 cDNA was isolated from a cDNA library obtained 
from human liver rather than kidney, as was hOATl . The entire cDNA and deduced 
amino acid sequences of human OAT4 are shown in Figure 5. Computer analysis of 
human OAT4 cDNA predicts that hOAT4 cDNA encodes a polypeptide that is 554 amino 
acids in length. Computer analysis of the deduced amino acid sequence reveals a 
consensus PKC phosphorylation site at amino acid 324. 

EST accession number AA705512 is homologous to hOAT4 cDNA at nucleotides 

1-1232 (Figure 5). 

Example 6 : Isolation of full-length hOAT5 cDNA 

Human OAT5 (hOAT5) cDNA was cloned in the same manner as described for 
human OAT1 cDNA in Example 1 except that EST accession number-AAA705512 was 
used as a probe. In addition, hOAT5 cDNA was isolated from a cDNA library obtained 
from human liver rather than kidney, as was hOATl. The entire cDNA and deduced 
amino acid sequences of human OATS are shown in Figure 6. Human OAT5 cDNA is 
predicted to encode a polypeptide that is 541 amino acids in length. Computer analysis of 
20 the deduced amino acid sequence reveals consensus PKC phosphorylation sites at amino 

acids 282, 289, 345, and 526. 

EST accession number AA705512 is homologous to hOAT5 cDNA at nucleotides 

1-1193 (Figure 6). 

25 Example 7 : Functional expression of human OAT 1 in HeLa cells 

Human OAT1 cDNA was subcloned into the mammalian expression vector 
pcDNA3.1 (Invitrogen, San Diego, CA), which contains a cytomegalovirus promoter. The 
plasmid construct (pcDNA-hOATl) was then amplified and purified. The construct was 
transfected into HeLa cells (ATCC). cultured in 25 mm 2 flasks, using the Effectene 

30 transfection reagent (Qiagen, Stanford Santa Clarita, CA). Control cell lines were 

transfected with a pcDNA-lacZ construct. One day post-transfection, the cells were split 
into six (6>well plates and cultured for another twenty-four (24) hours. Transfected cells 
were tested for transport of 3 H-/>-aminohippurate, a classical substrate of the kidney 
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organic anion transporter. Since OAT1 is an antiporter, the transfected cells were first 
preloaded with 1 mM glutarate for one (0 hour. The glutarate-containing media was 
removed and replaced with media containing 50 \iM PAH at 1 nCi/ml. After ten (10) 
minutes at 37°C, the media was aspirated and the cells were washed three times with ice 
5 cold phosphate-buffered saline (PBS). The cells were then solubilized with 0.5% SDS. 
The amount of labeled PAH transported into the cells was determined by a scintillation 
counter. As shown in Fig. 7, HeLa cells transfected with hOATl were able to uptake PAH 
whereas HeLa cells transfected with a vector control were only able to uptake a very small 
amount of PAH (almost 10-fold less than cells transfected with a vector containing DNA 
1 0 encoding hO AT 1 ). 

Example 8 : Functional expression of hOATl in Xenopus oocytes 
Complementary hOATl RNA was synthesized in vitro from the pcDNA-hOATl 
plasrnid using T7 RNA polymerase (Ambion, Austin, TX). The transcription reaction was 

15 performed at 37°C for 2 hours. The RNA was purified by phenol and chloroform 
extraction and precipitated with ammonium acetate and ethanol. Purified RNA was 
analyzed for size and purity by agarose gel electrophoresis. RNA was resuspended in 
water at a concentration of 0.3 \igf\i\. Oocytes were prepared from ovaries removed from 
female Xenopus laevis according to White et al., (1985) PNAS 82:4852-56. About 1 5 ng 

20 of hOATl cRNA in 50 nl was injected into each oocyte. As a control, 50 nl of water was 
injected into each oocyte. The injected oocytes were assayed for transport of PAH two (2) 
days post-injection. Oocytes were preloaded with 1 mM glutarate for 2 hours in Barth 
buffer. Transport assays were conducted in modified Barth buffer (100 mM NaCl, 2mM 
KC1, 1 mM CaCl 2 , 1 mM MgCl 2 , 10 mM HEPES, pH 7.5) with 50 yM PAH at 4 jiCi/ml. 

25 After a one (1) hour incubation at 25°C, oocytes were washed with 3 ml ice cold transport 
assay buffer three (3) times. Each oocyte was solubilized in 0.5 ml 0.5% SDS and 
subjected to scintillation counting. As shown in Fig. 8, oocytes injected with the hOATl 
RNA uptake 3 H-PAH to a much greater extent than water injected oocytes. 

30 Example 9 : Identification of fluorescein as a substrate for hOATl 

Human embryonic kidney (HEK293) cells were transfected with pcDNA-hOATl, 
as in Example 7. One day after transfection the cells were assayed for transport of 
fluorescein or SITS. Ten (1 0) nM of either substrate was added to the external media for 
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ten (10) minutes at 37°C. The cells were washed with ice-cold PBS to remove excess 
label and analyzed under a fluorescent microscope. Cells that take up either fluorescein or 
SITS are predicted to turn fluorescent green. Fluorescent microscopy revealed that 
hOATl -expressing cells took up fluorescein but not SITS. 

5 

Example 10 : Competition assay to identify substrates of hOATl using 
fluorescein 

HEK293 cells were transiently transfected with pcDNA3-hOATl as in Example 7 
and assayed for transport of fluorescein, but in the presence or absence of 0.5 mM PAH. 
1 0 Since fluorescein and PAH are both substrates of hO AT 1 , they will compete for access to 
the transporter. As evidenced by fluorescent microscopy, hO AT 1 -expressing cells showed 
uptake of fluorescein in the absence but not in the presence of external PAH. 

Example 1 1 : Establishment of Cell lines stably expressing hOATs. 

15 Vector pIND(spl) (Invitrogen) contains the ecdysone-inducible promoter (EcRE). 

Using standard methods known to those in the art, cDNAs encoding hOATl, hOAT2A, 
into vector pIND/spl downstream of its EcRE promoter. After cloning, the vectors were 
separately transfected into cell line EcR293 (Invitrogen). The EcR293 cell line is a 
derivative of cell line HEK 293 and contains a stably integrated pVgRXR vector. The 

20 pVgRXR vector constitutively expresses the ecdysone receptor and the retinoid X receptor 
(RXR). The ecdysone receptor and RXR form a heterodimer which, upon activation by 
the hormone ecdysone, binds to a promoter containing the EcRE. The pVgRXR vector 
was maintained within the EcR293 cell line by selection with 400 ng/ml zeocin. The 
pIND(spl)-hOATplasmids contain a neomycin resistance marker, and cells stably 

25 integrating the plasmids were selected for by culturing the cells in media supplemented 
with 400 ng/ml G418. After two weeks of selection, individual colonies appeared and 
were isolated and expanded. To test for expression of the transporter, the cells were 
induced with 1 .25 ng/ml ponasterone, an ecdysone analogue, for twenty-four (24) hours. 
The cells were then subjected to standard transport assays. 

30 EcR293 cells with stably integrated pIND(spl)-hOATl expressed hOATl at a 

basal level that .was not harmful to the cells. These cells grow at a rapid rate comparable 
to the parental EcR293 cells. Hence, the hOATl cDNA can be maintained within these 
cells indefinitely. Expression of hOATl at high levels can be easily achieved by inducing 
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with ponasterone for twenty-four (24) hours. As shown in Fig. 9, ECR293 cells 
containing hOATl in plasmid pIND(sp-l) uptake approximately 50- fold more PAH when 
induced by ponasterone than non-induced cells containing pIND(sp-l)-hOATl . ECR293 
cells stably containing hOAT2A, in plasmid plND(sp-l), uptake approximately 2.6-fold 
5 more 3 H-PAH when induced by ponasterone than non-induced cells (Fig. 10). Cell lines 
stably expressing hOAT2B, hOAT3, hOAT4, and hOAT5 can be established using the 
same method as for hOATl and hOAT2A. 

Example 12 : Competition assay to identify new substrates of hOAT2, using a 

10 radiolabeled substrate 

EcR293/OAT2A cells were assayed for transport of 3 H-PAH as in Example 7. The 
uptake of 3 H-PAH was blocked by 100 pM sulfobromophthalein (BSP) but not 500 pM a- 
ketoglutarate (AKG) (Fig. 1 1). The competition assay thus identified BSP and not AKG 
as a preferred substrate of hOAT2A. 

15 

Example 13 : Predicting pharmacokinetics in man using a panel of hOAT 
expressing cells 

A panel of different cell lines, each expressing a particular human OAT protein, 
can be used to predict pharmacokinetics for anionic drugs. For instance, PAH is a 

20 compound transported rapidly by hOATl and to a much lesser extent by the other human 
OATs disclosed in the invention. The panel of hOAT expressing cells would predict that 
PAH is eliminated rapidly by the human kidney, as previously described. Similarly, the 
competition assays in Examples 10 and 12 showed that fluorescein is a very good 
substrate of human OAT1 and that BSP is a good substrate of human OAT2A. Therefore, 

25 the assay would predict that fluorescein is eliminated from the human body mainly via 
kidney excretion and that BSP is mainly eliminated from the human body via liver 
excretion. 

Further confirmation that a compound is a substrate of an hOAT would be done 
using in vivo tissue distribution assays. Compounds found to be transported by hOATs, 
30 using the cell line described above, would be injected into animals and followed to 

confirm their transport by the appropriate hOAT. For instance, a compound found to be 
transported by a cell line stably expressing hOATl, would be expected to be excreted in 
the urine of an animal injected with the said compound. Similarly, a compound found to 
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be transported by a cell line expressing hOAT2, would be expected to be transported into 
the liver of animals injected with said compound. 

Since hOAT4 and 5 have homology to OCTs, their range of of substrates may 
include organic cations. Therefore, these transporters may be used to predict the 
5 pharmacokinetics of certain cationic drugs as well, using the methods described herein. 

Example 14 : Identification of hO AT substrates using antiporter activity. 

EcR293/hOATl cells are incubated with 3 H-ot-ketoglutarate for an hour to preload 
with labeled a-ketoglutarate. A test substrate is added to the cells. After a 10 minute 
10 incubation, an aliquot of the supernatant is removed and measured for effluxed <x- 

ketoglutarate by scintillation counting. Substrates of hOATl will show trans-stimulation 
of a-ketoglutarate exchange. Inhibitors or non-substrates of hOATl will not stimulate a- 
ketoglutarate efflux. 

1 5 Example 15 : Cytoxicity assays using cell lines stably expressing hOAT 

polypeptides 

EcR293 cells stably transformed with hOAT polypeptides can be used to screen for 
compounds potentially toxic to certain organs or tissues through use of a cytoxicity assay. 
Test compounds can be added to cell lines stably expressing hOAT polypeptides. 
20 Compounds transported by hOAT polypeptides and which are toxic will cause cell death. 
Cell death can be measured or determined using techniques well known in the art, such as 
visual inspection, microscopic inspection, and dye exclusion. Using dye exclusion, 
healthy cells are able to exclude a particular dye whereas injured or dead cells are unable 
to exclude the dye. 

25 The foregoing examples are not limiting and are merely representative of various 

aspects and features of the present invention. All references referred to above are 
incorporated herein by reference. 

Example 16 : Functional expression of hOAT3 in Xenopus oocytes 
Complementary hOAT3 RNA was synthesized in vitro from the pcDNA-hOAT3 
30 plasmid using T7 RNA polymerase (Ambion, Austin, TX). The transcription reaction was 
performed at 37°C for 2 hours. The RNA was purified by phenol and chloroform 
extraction and precipitated with ammonium acetate and ethanol. Purified RNA was 
analyzed for size and purity by agarose gel electrophoresis. RNA was resuspended in 
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water at a concentration of 0.5 Oocytes were prepared from ovaries removed from 

female A 'enopus laevis according to White et aL (19851 PNAS 82:4852-56. About 25 ng 
of hOAT3 cRNA in 50 nl was injected into each oocyte. As a control, 50 nl of water was 
injected into each oocyte. The injected oocytes were assayed for transport of PAH two (2) 
5 days post-injection. Transport assays were conducted in modified Barth buffer (1 00 mM 
NaCK 2mM KCL 1 mM CaCb, 1 mM MgCl 2 , 10 mM HEPES, pH 7.5) with 50 \xM PAH 
at 10 jiCi/ml. After a one (1) hour incubation at 25°C, oocytes were washed with 3 ml ice 
cold transport assay buffer three (3) times. Each oocyte was solubilized in 0.5 ml 0.5% 
SDS and subjected to scintillation counting. As shown in Fig. 12 t oocytes injected with 
10 the hOAT3 RNA uptake 3 H-PAH to a much greater extent than water injected oocytes. 

Example 17 : Determination of tissue distribution of hOATs 
The individual hOAT cDNAs were labeled with P-ATP to a specific activity of > 
1 x 10 9 dpm/fig using the Strip-EZ kit™ (Ambion). The Strip-EZ kit™ allows easy 

15 removal of probes from a blot and the blot can thus be reprobed several times without 
substantial loss of signal. The Human Multiple Tissue Northern blot was obtained from 
Clontech, which contained mRNA extracted from 12 different human tissues. The blot 
was successively hybridized with each hOAT probe in 5 ml of ExpressedHyb buffer 
(Clontech) at a final concentration of 3-5 x 10 6 dpm/ml. Hybridization was carried out at 

20 65°C for 4 hrs. The blot was washed two times with 2X SSC and 0. 1% SDS at 65°C for 
15 min each, and then once with 0.1X SSC and 0.1% SDS at 65°C for 1 hr. The blot was 
then exposed to BioMax MR autographic film (Kodak) with intensifying screen at -80°C 
for 24 hrs. 

The northern analysis shows that hOATl was expressed only in human 
25 kidney (FIG 13). The OAT1 probe hybridized to a major band at about 2.4 kb. 

The hOAT2, probe hybridized to two major bands at about 2.4 and about 3 kb from 
liver and kidney, with higher levels in the liver than kidney (FIG 13). HOAT3 was 
expressed only in the kidney, with a major transcript of about 2.4 kb and minor 
transcript at about 4.3 kb (FIG 13). HOAT4 and hOAT5 were expressed 
30 exclusively in the liver (FIG 13). hOAT4 probe hybridized to a major band at 

about 4.3 kb and hOAT5 probe hybridized to a major band at about 2.5 kb. Hence, 
northern analyses indicated that hOATl and 3 are mainly kidney transporters and 
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hOAT4 and 5 are liver transporters, whereas hOAT2 is a liver and kidney 
transporter. 

All patents and publications mentioned in the specification are indicative of the 
levels of skill of those skilled in the art to which the invention pertains. All references 
5 cited in this disclosure are incorporated by reference to the same extent as if each 
reference had been incorporated by reference in its entirety individually. None of the 
references are admitted to be prior art. 

One skilled in the art would readily appreciate that the present invention is well 
adapted to carry out the objects and obtain the ends and advantages mentioned, as well as 
10 those inherent therein. The methods and compositions described herein as presently 
representative of preferred embodiments are exemplary and are not intended as limitations 
on the scope of the invention. Changes therein and other uses will occur to those skilled in 
the art, which are encompassed within the spirit of the invention, are defined by the scope 
of the claims. 

15 It will be readily apparent to one skilled in the art that varying substitutions and 

modifications may be made to the invention disclosed herein without departing from the 
scope and spirit of the invention. Thus, such additional embodiments are within the scope 
of the present invention and the following claims. 

The invention illustratively described herein suitably may be practiced in the absence 

20 of any element or elements, limitation or limitations which is not specifically disclosed 
herein. Thus, for example, in each instance herein any of the terms "comprising", 
"consisting essentially of and "consisting of* may be replaced with either of the other 
two terms. The terms and expressions which have been employed are used as terms of 
description and not of limitation, and there is no intention that in the use of such terms and 

25 expressions of excluding any equivalents of the features shown and described or portions 
thereof, but it is recognized that various modifications are possible within the scope of the 
invention claimed. Thus, it should be understood that although the present invention has 
been specifically disclosed by preferred embodiments, optional features, modification and 
variation of the concepts herein disclosed may be resorted to by those skilled in the art, 

30 and that such modifications and variations are considered to be within the scope of this 
invention as defined by the description and the appended claims. 

In addition, where features or aspects of the invention are described in 
terms of Markush groups or other grouping of alternatives, those skilled in the art 
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will recognize that the invention is also thereby described in terms of any 
individual member or subgroup of members of the Markush group or other group. 

Other embodiments are within the following claims. 
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What is claimed is: 

1 . An isolated, enriched, or purified nucleic acid molecule comprising a 
nucleic acid molecule selected from the group consisting of SEQ ID NO:2, SEQ ID NO:3, 
SEQ ID NO:4, SEQ ID NO:5, and SEQ ID NO:6. 

2. An isolated, enriched, or purified nucleic acid molecule comprising a 
nucleic acid molecule which encodes a human OAT polypeptide selected from the group 
consisting of SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:l 1, and SEQ ID 
NO:12. 

3. An isolated, enriched, or purified nucleic acid molecule which is 
complementary to any of the nucleic acid molecules of claims 1 and 2. 



4. The isolated, enriched, or purified nucleic acid molecule of claim 1, 
1 5 wherein said nucleic acid molecule is SEQ ID NO:2. 

5. The isolated, enriched, or purified nucleic acid molecule of claim 1, 
wherein said nucleic acid molecule is SEQ ID NO:3. 

20 6. The isolated, enriched, or purified nucleic acid molecule of claim 1, 

wherein said nucleic acid molecule is SEQ ID NO:4. 

7. The isolated, enriched, or purified nucleic acid molecule of claim 1, 
wherein said nucleic acid molecule is SEQ ID NO:5. 



8, The isolated, enriched, or purified nucleic acid molecule of claim 1 , 
wherein said nucleic acid molecule is SEQ ID NO:6. 



9. The isolated, enriched, or purified nucleic acid molecule of claim 2, 
30 wherein said nucleic acid molecule encodes a polypeptide that is SEQ ID NO:8. 

10. The isolated, enriched, or purified nucleic acid molecule of claim 2, 
wherein said nucleic acid molecule encodes a polypeptide that is SEQ ID NO:9. 

1 1 . The isolated, enriched, or purified nucleic acid molecule of claim 2, 
35 wherein said nucleic acid molecule encodes a polypeptide that is SEQ ID NO:10. 
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12. The isolated, enriched, or purified nucleic acid molecule of claim 2, 
wherein said nucleic acid molecule encodes a polypeptide that is SEQ ID NO:l 1. 

13. The isolated, enriched, or purified nucleic acid molecule of claim 2, 
5 wherein said nucliec acid molecule encodes a polypeptide that is SEQ ID NO: 12. 

14. An isolated nucleic acid molecule comprising a nucleotide sequence 
selected from the group consisting of: 

(a) a nucleotide sequence which encodes a naturally occurring variant 
10 of a polypeptide comprising the amino acid sequence of SEQ ID NO:8, wherein the 

nucleotide sequence hybridizes to a nucleic acid molecule comprising SEQ ID NO:2 under 

stringent conditions; 

(b) a nucleotide sequence which encodes a naturally occurring variant 
of a polypeptide comprising the amino acid sequence of SEQ ID NO:9, wherein the 

15 nucleotide sequence hybridizes to a nucleic acid molecule comprising SEQ ID NO:3 under 
stringent conditions; 

(c) a nucleotide sequence which encodes a naturally occurring variant 
of a polypeptide comprising the amino acid sequence of SEQ ID NO: 10, wherein the 
nucleotide sequence hybridizes to a nucleic acid molecule comprising SEQ ID NO:4 under 

20 stringent conditions; 

(d) a nucleotide sequence which encodes a naturally occurring variant 
of a polypeptide comprising the amino acid sequence of SEQ ED NO:l 1, wherein the 
nucleotide sequence hybridizes to a nucleic acid molecule comprising SEQ ID NO:5 under 
stringent conditions; and 

25 (e) a nucleotide sequence which encodes a naturally occurring variant 

of a polypeptide comprising the amino acid sequence of SEQ ID NO: 12, wherein the 
nucleotide sequence hybridizes to a nucleic acid molecule comprising SEQ ID NO:6 under 
stringent conditions. 
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15. An isolated nucleic acid molecule comprising a nucleotide sequence that 
has at least 83% identity to nucleotide sequence contained in SEQ ID NO:2, wherein the 
percent identity is calculated using the GAP-Alignment program in the GCG software 
package, using a gap weight of 5.0 and a length weight of 0.3. 

1 6. An isolated nucleic acid molecule comprising a nucleotide sequence that 
has at least 83% identity to nucleotide sequence contained in SEQ ID NO:3, wherein the 
percent identity is calculated using the GAP- Alignment program in the GCG software 
package, using a gap weight of 5.0 and a length weight of 0.3. 

17. An isolated nucleic acid molecule comprising a nucleotide sequence that 
has at least 61% identity to nucleotide sequence contained in SEQ ID NO:4, wherein the 
percent identity is calculated using the GAP-Alignment program in the GCG software 
package, using a gap weight of 5.0 and a length weight of 0.3. 

18. An isolated nucleic acid molecule comprising a nucleotide sequence that 
has at least 60% identity to nucleotide sequence contained in SEQ ID NO:5, wherein the 
percent identity is calculated using the GAP-Alignment program in the GCG software 
package, using a gap weight of 5.0 and a length weight of 0.3. 

19. An isolated nucleic acid molecule comprising a nucleotide sequence that 
has at least 60%' identity to nucleotide sequence contained in SEQ ID NO:6, wherein the 
percent identity is calculated using the GAP-Alignment program in the GCG software 
package, using a gap weight of 5.0 and a length weight of 0.3. 

20. An isolated nucleic acid molecule comprising a nucleotide sequence 
encoding a polypeptide which is at least 80% percent identical to the amino acid sequence 
of SEQ ID NO: 8 or 9, wherein said percent identity is calculated using the GAP program 
in the GCG software package, using a gap weight of 3.0 and a length weight of 0.1. 



30 
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21. An isolated nucleic acid molecule comprising a nucleotide sequence 
encoding a polypeptide which is at least 70% percent identical to the amino acid sequence 
of SEQ ID NO: 10, wherein said percent identity is calculated using the GAP program in 
the GCG software package, using a gap weight of 3.0 and a length weight of 0.1. 

5 

22. An isolated nucleic acid molecule comprising a nucleotide sequence 
encoding a polypeptide which is at least 70% percent identical to the amino acid sequence 
of SEQ ID NO:l 1, wherein said percent identity is calculated using the GAP program in 
the GCG software package, using a gap weight of 3.0 and a length weight of 0.1. 

10 

23. An isolated nucleic acid molecule comprising a nucleotide sequence 
encoding a polypeptide which is at least 70% percent identical to the amino acid sequence 
of SEQ ID NO: 12, wherein said percent identity is calculated using the GAP program in 
the GCG software package, using a gap weight of 3.0 and a length weight of 0.L 

15 

24. The nucleic acid molecule of claim 1 or 2, further comprising a vector or 
promoter effective to initiate transcription in a host cell. 

25. The nucleic acid molecule of claim 24, wherein said promoter comprises an 
20 inducible promoter. 

26. A nucleic acid probe for the detection, isolation, purification, enrichment, 
or amplification of a nucleic acid molecule encoding human OAT in a sample, wherein 
said nucleic acid probes are selected from the group of nucleic acid sequences set forth in, 

25 SEQ ED NO:2 from nucleotide 1370- 1638, SEQ ID NO: 3 from nucleotide 1370-1614, 
SEQ ID NO:4: from nucleotide 1-1 107, SEQ ID NO: 5 from nucleotide 995-1662 and 
SEQ ID NO:6 from nucleotide 993-1623. 

27. A nucleic acid probe for the detection, isolation, purification, enrichment, 
30 or amplification of a nucleic acid molecule encoding human OAT in a sample, wherein 

said nucleic acid probes are selected from the group of nucleic acid sequences encoding 
the polypeptide fragment selected from the group consisting of amino acids 132-136 of 
SEQ ID NO:8, amino acids 132-136 of SEQ ID NO:9, amino acids 1 12-1 16 of SEQ ID 
NO: 10, amino acids 136-140 of SEQ ID NO:l 1 and amino acids 135-139 of SEQ ID 
35 NO: 12. 
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28. A recombinant cell comprising a nucleic acid molecule encoding a human 
OAT polypeptide wherein said nucleic acid molecule is selected from the group consisting 
ofSEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, and SEQ ID NO:6. 

29. The cell of claim 28, wherein said cell is stably transformed. 

30. The cell of claim 28 or 29, wherein said cell is a mammalian cell. 

3 1 . The cell of claim 28 or 29, wherein said polypeptide is a fragment of the 
polypeptide encoded by the amino acid sequence encoded by the nucleic acid sequence set 
forth in SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, or the 
amino acid sequence set forth in SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO:10, SEQ ID 
NO: 1 1 , or SEQ ID NO: 12. 

32. The cell of claim 28 or 29, wherein said nucleic acid is expressed from an 
inducible promoter. 

33. The cell of claim 32, wherein said promoter is inducible with ecdysone. 

34. An isolated, enriched, or purified human OAT polypeptide encoded by the 
nucleic acid sequence set forth in SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID 
NO:5, or SEQ ID NO:6, or the amino acid sequence set forth in SEQ ID NO:8, SEQ ID 
NO:9, SEQ ID NO:10, SEQ ID NO:l 1, or SEQ ID NO:12. 

35. The polypeptide of claim 34, wherein said polypeptide is a fragment of the 
polypeptide encoded by the nucleic acid sequence set forth in SEQ ID NO:2, SEQ ID 
NO:3, SEQ ID NO:4, SEQ ID NO:5, or SEQ ID NO:6, or the amino acid sequence set 
forth in SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO: 1 1, or SEQ ID NO: 12. 



36. The polypeptide of claim 34, wherein said polypeptide is isolated, purified, 
or enriched from a cell that comprises an endogenous nucleic acid molecule that encodes 
said polypeptide. 
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37. The polypeptide of claim 34, wherein said polypeptide is isolated, purified, 
or enriched from a cell that is transformed with a nucleic acid molecule that encodes said 
polypeptide. 

5 38. The polypeptide of claim 34, wherein said polypeptide is chemically 

synthesized. 

39. A labeled or unlabeled antibody or antibody fragment having specific- 
binding affinity to a human OAT polypeptide encoded by the nucleic acid sequence set 

10 forth in SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, or SEQ ID NO:6, or 
the amino acid sequence set forth in SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO: 10, SEQ 
ID NO:ll, or SEQ ID NO: 12. 

40. A hybridoma which produces an antibody having specific binding affinity 
15 to a human OAT polypeptide encoded by the nucleic acid sequence set forth in SEQ ID 

NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, or SEQ ID NCk6, or the amino acid 
sequence set forth in SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO: 1 1, or 
SEQ ID NO: 12. 

20 4 L A method for identifying modulators of human OAT2A, human OAT2B, 

human OAT3, human OAT4, or human OATS polypeptide, said method comprising: 

(a) contacting said hOAT polypeptide with a test substance; 

(b) adding a substrate, 

(c) measuring the activity of said hOAT polypeptide; and 

25 (d) if said substance decreases transport, determining whether said test 

substance is a modulator or competitive inhibitor of said hO AT polypeptide. 

42. A method for identifying a modulator of human OAT2A, human OAT2B, 
human OAT3, human OAT4, or human OATS polypeptide in a cell, said method 
30 comprising: 

(a) expressing an hOAT polypeptide in a cell; 

(b) contacting said cell with a test substance; 

(c) adding a substrate; 

(d) measuring the activity of said hOAT polypeptide; and 
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(e) if said substance decreases transport, determining whether said test 
substance is a modulator or competitive inhibitor of said hOAT. 

43. A method to screen for substrates or inhibitors of human OAT2A, human 
OAT2B, human OAT3, human OAT4, and human OATS using a cell stably transformed 
with an hOAT nucleic acid molecule and a competition assay, said method comprising: 

(a) adding a test substrate to a cell line expressing said human OAT 

polypeptides; 

(b) adding a substrate known to be transported by said human OAT 

polypeptides in said cell line; 

(c) measuring the ability of said test substrate to compete with the 
uptake of said substrate known to be transported by human OAT1, human OAT2A, human 
OAT2B, human OAT3, human OAT4, and human OATS; and 

(d) confirming whether said test substrate is an inhibitor or substrate, 

44. The method of claim 43, wherein said test substrate and/or known substrate 
are fluorescent or conjugated with a fluorescent labeled. 

45. The method of claim 44, wherein said assay is evaluated using high- 
20 throughput screening. 

46. A method for identifying substrates of human OAT polypeptides, said 

method comprising: 

(a) loading a cell line expressing an hOAT polypeptide with a 
25 substance know to be effluxed from the interior of said cell as part of the antiporter 

activity of said hOAT polypeptide 

(b) adding a test substrate to said cell line; 

(c) measuring the efflux of said antiporter substance; and 

{d) determining if said test substrate affected the efflux of said 
30 antiporter substance. 

47. The cell of claim 28 or 29, wherein said cell is a yeast cell. 

48. The cell of claim 28 or 29, wherein said cell is an insect cell. 
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49. A method of preparing an hOAT polypeptide comprising: 

(a) culturing the recombinant cell of claim 29 under conditions that 
permit expression of the hOAT polypeptide; and 

lb) isolating said polypeptide. 



50. An amphibian oocyte containing a nucleic acid molecule encoding for a 
human OAT polypeptide according to claim 2. 

5 1 . The amphibian oocyte of claim 50 that is a Xenopus laevis oocyte. 

52. The nucleic acid molecules of claim 1 or 2, wherein said nucleic acid 
molecules comprise one or more regions that encode an hOAT polypeptide or an hOAT 
polypeptide fragment where the hOAT polypeptide or the hOAT polypeptide fragment is 
fused to a non-hOAT polypeptide or epitope tag. 

53. The nucleic acid molecule of claim 52, wherein said non-hOAT 
polypeptide or epitope tag is selected from the group consisting of glutathione-S- 
transferase, green- fluorescent protein and an amino terminal tag composed of histidine 
residues. 



54. A method to identify toxic compounds transported by hOAT polypeptides 
comprising: 

(a) expressing an hOAT polypeptide in cells; 

(b) adding a test substrate to said cells; and 
25 (c) determining if said cells are viable. 
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CTGAGCTGACCTGACCCCCAAAGTGAAGGAGAACCTGCAAGGGAAAACCCAGGGACAGAT 
1 +- + f +- f + 60 

GACTCGACTGGACTGGGGGTTTCACTTCCTCTTCGACGTTCCCTTTTCCCTCCCTGTCTA 

CAGGGAGACCGGGGAAGAAGGAGGAGCAGCCAACGAGGCTGCTGTCCCCCCACAGAGCAG 
6| + + 4— 4- 4- + 120 

GTCCH'CTGGCCCCTTCTTCCTCCTCGTCGGTTCCTCCGACGACAGGCGGGTGTCTCGTC 

CTCGGACTCAGCTCCCGGAGCAACCCAGCTGCGGAGGCAACGGCAGTGCTGCTCCTCCAG 
121 + + + + + + 180 

GAGCCTGAGTCGAGGCCCTCGTTGGGTCGACGCCTCCGTTGCCGTCACCACGAGGAGGTC 
CGAAGGACAGCAGGCAGGCAGACAGACAGAGGTCCTGGGACTGGAAGGCCTCAGCCCCCA 

181 + 4- 4- +— 4- -f 240 

CCTTCCTGTCGTCCGTCCGTCTGTCTGTCTCCAGGACCCTGACCTTCCGGAGTCGGGGGT 

GCCACTGCGCTGGGCCTCGCCCAATGGCCTTTAATGACCTCCTGCAGCAGCTGGGGGGTG 
241 + 4— -+ +- 4- 4- 300 

CGGTGACCCGACCCCGACCGGGTTACCGGAAATTACTGGAGGACGTCGTCCACCCCCCAC 

MAFNDLLQQVGGV- 

TCGCCCGCTTCCAGCAGATCCAGGTCACCCTGGTGGTCCTCCCCCTGCTCCTGATGGCTT 
301 4- — -+ + 4- -+~- 4- 360 

AGCCGGCGAAGGTCGTCTAGGTCCAGTGGGACCACCAGGAGGGGGACGAGGACTACCGAA 

GR FQQIQ VTLVVLPLLLMAS- 
CrCACAACACCCTGCAGAACrrCACTGCTGCCATCCCTACCCACCACTCCCGCCCGCCTG 

361 4- 4- 4-—- 4- 4- 4- 420 

GAGTGTTGTGGGACGTCTTGAAGTGACGACGGTAGGGATGGGTGGTGACCGCGGGCGGAC 

HNTLQNFTAAIPTHHCRPPA- 

CCGATGCCAACCTCAGCAAGAACGGGGGGCTGGAGCTCTGGCTGCCCCGGGACAGGCAGG 
421 4- + 4- —4- ¥ 480 

GGCTACGGTTGGAGTCCTTCTTGCCCCCCGACCTCCAGACCGACCGGGCCCTGTCCGTCC 
DANLSKNGGLEVWLPRDRQG- 

GGCAGCCTGAGTCCTGCCTCCGCTTCACCTCCCCGCAGTGGGGACTGCCCTTl'CTCAATG 
481 ~ -4- + + 4- 4- + 540 

CCGTCGGACTCAGGACCGAGGCCAAGTGGACGGCCGTCACCCCTGACGCCAAAGAGTTAC 

QPESCLRFTSPQWCLPFLNG- 

GCACAGAAGCCAATGGCACACGGCCCACAGAGCCCTGCACCGATGGCTGGATCTATGACA 
541 +~ 4- — 4- 4- + 600 

CGTGTCTTCGGTTACCGTGTCCCCGGl'GTCTCCCCACGTGGCTACCGACCTAGATACTCT 
TEANGTGATEPCTDGWIYDN- 
ACAGCACCTTCCCATCTACCATCGTGALTCAGTGGGACCTTGTGTGCTCTCACAGGGCCC 

601 4- 4- + 4- 4- -4- 660 



FIG. 1A 



SUBSTITUTE SHEET (RULE 26) 



BNSDOCID: <WO 01 04283 A3_IA> 



WO 01/04283 



PCT/US00/18980 



2/31 



TGTCGTGGAAGGGTAGATGGTAGCACTGACTCACCCTGGAACACACGAGAGTGTCCCGGG 
STF PSTIVTEWDLVCSHRAL- 

TACGCCAGCTGGCCCAGTCCTTGTACATGGTGGGGGTGCTGCTCGGAGCCATGGTGTTCG 

661 4--- + + + + -+ 720 

ATGCGGTCGACCGGGTCAGGAACATGTACCACCCCCACGACGAGCCTCGGTACCACAAGC 

RQLAQSLYMVGVLLGAMVFG- 

GCTACCTTGCAGACAGGCTAGGCCGCCGGAAGGTACTCATCTTGAACTACCTGCAGACAG 

721 + + +- + +-- --+ 780 

CGATGGAACGTCTGTCCGATCCGGCGGCCTTCCATGAGTAGMCTTGATGGACGTCTGTC 

YLADRLGRRKVLILNYLQTA- 

CTGTGTCAGGGACCTGCGCAGCCTTCGCACCCAACTTCCCCATCTACTGCGCCTTCCGGC 

781 + + + -+ --+ + 840 

GACACAGTCCCTGGACGCGTCGGAAGCGTGGGTTGAAGGGGTAGATGACGCGGAAGGCCG 

V SGTCAAFA PNFP I YCAFRL- 

TCCTCTCGGGCATGGCTCTGGCTGGCATCTCCCTCAACTGCATGACACTGAATGTGGAGT 

841 + ---+ + +-- + + 900 

AGGAGAGCCCGTACCGAGACCGACCGTAGAGGGAGTTGACGTACTGTGACTTACACCTCA 

LSGMALAGI SLNCMTL NVEW- 

GGATGCCCATTCACACACGGGCCTGCGTGGGCACCTTGATTGGCTATGTCTACAGCCTGG 

901 + + + + + + 960 

CCTACGGGTAAGTGTGTGCCCGGACGCACCCGTGGAACTAACCGATACAGATGTCGGACC 

M P I H TRACVGTL IGY VYSLG- 

GCCAGTTCCTCCTGGCTGGTGTGGCCTACGCTGTGCCCCACTGGCGCCACCTGCAGCTAC 

961 ---+ + + +-- --+ + 1020 

CGGTCAAGGAGGACCGACCACACCGGATGCGACACGGGGTGACCGCGGTGGACGTCGATG 

QFLLAGVAYAVPHWRHLQLL- 

TGGTCTCTGCGCC 1 1 1 1 1 1 1 GCCTTCTTCATCTACTCCTGGTTCTTCA1TGAGTCGGCCC 

1021 + + + + + + 1080 

ACCAGAGACGCGGAAAAAAACGGAAGAAGTAGATGAGGACCAAGAAGTAACTCAGCCGGG 

VSAP FFAFF IYSWFF I ESAR- 

GCTGGCACrCCTCCTCCGGGAGGCTGGACCTCACCCTGAGGGCCCTGCAGAGAGTCGCCC 

1081 + + + + -+ + 1140 

CGACCGTGAGGAGGAGGCCCTCCGACCTGGAGTGGGACTCCCGGGACGTCTCTCAGCGGG 

WHSSSGRLDLTLRALQRVAR- 

GGATCAATGGGAAGCGGGAAGAAGGAGCCAAATTGAGTATGGAGGTACTCCGGGCCAGTC 
1141 + + + + + + 1200 
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CCTAGTTACCCTTCGCCCTTCTTCCTCGGTTrAACTCATACCTCCATGAGGCCCGGTCAG 
I NGKREEGAKLSMEVLRASL- 

TGCAGAAGGAGCTGACCATGGGCAAAGGCCAGGCArCGGCCATGGAGCTGCTGCGCTGCC 

1201 -- + + — + — + + + 1260 

ACGTCTTCCTCGACTGGTACCCGTTTCCGGTCCGTAGCCGGTACCTCGACGACGCGACGG 

QKELTMGKGQASAMELLRCP- 

CCACCCTCCGCCACCTCTTCCTCTGCCTCTCCATGCTGTGGTrTGCCACTAGCTTTGCAT 

1261 -- + + + — -+ — + -+ 1320 

GGTGGGAGGCGGTGGAGAAGGAGACGGAGAGGTACGACACCAAACGGTGATCGAAACGTA 

TLRHLFICLSMLWFATSFAY- 

ACTATGGGCTGGTCATGGACCTGCAGGGCTTTGGAGTCAGCATCTACCTAATCCAGGTGA 

1221 + + + + — + + 1380 

TGATACCCGACCAGTACCTGGACGTCCCGAAACCTCAGTCGTAGATGGATTAGGTCCACT 

YGL VMDLQGFGVSIY LIQV I - 

TCTTTGGTGCTGTGGACCTGCCTGCCAAGCTTGTGGGCTTCCTTGTCATCAACTCCCTGG 

1381 + + ---+ + — + --+ 1440 

AGAAACCACGACACCTGGACGGACGGTTCGAACACCCGAAGGAACAGTAGTTGAGGGACC 

FGAVD'LPAKLVGFLV I NSLG- 

GTCGCCGGCCTGCCCAGATGGCTGCACTGCTGCTGGCAGGCATCTGCATCCTGCTCAATG 

1441 + + + + + + 1500 

CAGCGGCCGGACGGGTCTACCGACGTGACGACGACCGTCCGTAGACGTAGGACGAGTTAC 

R RPAQMAALLLAG I C I LL N G - 

GGGTGATACCCCAGGACCAGTCCATTGTCCGAACCTCTCTTGCTGTGCTGGGGAAGGGTT 

1501 + + + — -+ + + 1560 

CCCACTATGGGGTCCTGGTCAGGTAACAGGCTTGGAGAGAACGACACGACCCCTTCCCAA 

V I PQDQS I VRTSLAVLGKGC- 

GTCrGGCTGCCTCCTTCAACTGCATCTTCCTGTATACTGGGGAACTGTATCCCACAATGA 

1561 + + + + + + 162 0 

CAGACCGACGGAGGAAGTTGACGTAGAAGGACATATGACCCCTTGACATAGGGTGTTACT 

LAASFNC I FLYTGELYP TMI- 

TCCGGCAGACAGGCATGGGAATGGGCAGCACCATGGCCCGAGTGGGCAGCATCGTGAGCC 

1521 + + --+ +- — + 1680 

AGGCCGTCTGTCCGTACCCTTACCCGTCGTGGTACCGGGCTCACCCGTCGTAGCACTCGG 

RQTGMGMGSTMARVGS I V. S P - 

CACTGGTGAGCATGACTGCCGAGCTCTACCCCTCCATGCCTCTCTTCATCTACGGTGCTG 
1681 + + + + + + 1740 
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GT6ACCACTCGTACTGACGGCTCGA6ATGGGGAGGTACGGAGAGAAGTAGATGCCACGAC 
LVSMTAELYPSMPLFI Y G A V - 

TTCCTGTGGCCGCCAGCGCTGTCACTGTCCTCCTGCCAGAGACCCTGGGCCAGCCACTGC 

1741 + + + + + + 1800 

AAGGACACCGGCGGTCGCGACAGTGACAGGAGGACGGTCTCTGGGACCCGGTCGGTGACG 

PVAASAVTV LLPETLGQPL P - 

CAGACACGGTGCAGGACCTGGAGAGCAGGAAAGGGAAACAGACGCGACAGCAACAAGAGC 

1801 + + + +- + + I860 

GTCTGTGCCACGTCCTGGACCTCTCGTCCTTTCCCTTTGTCTGCGCTGTCGTTGTTCTCG 

DTVQDL ESRK.GKQTRQQQEH- 

ACCAGAAGTATATGGTCCCACTGCAGGCCTCAGCACAAGAGAAGAATGGACTCTGAGGAC 

1861 + + + + + + 1920 

TGGTCTTCATATACCAGGGTGACGTCCGGAGTCGTGTTCTCTTCTTACCTGAGACTCCTG 

QKYMVPLQASAQEKNGL* 

TGAGAAGGGGCCTTACAGAACCCTAAAGGGAGGGAAGGTCCTACAGGTCTCCGGCCACCC 

1921 + + + + + + 1980 

ACTCTTCCCCGGAATGTCTTGGGATTTCCCTCCCTTCCAGGATGTCCAGAGGCCGGTGGG 



ACACAAGGAGGAGGAAGAGGAAATGGTGACCCAAGTGTGGGGGTTGTGGTTCAGGAAAGC 

1981 +--- + + + -+ + 2040 

TGTGTTCCTCCTCCTTCTCCTTTACCACTGGGTTCACACCCCCAACACCAAGTCCTTTCG 



ATCTTCCCAGGGGTCCACCTCCCTTTATAAACCCCACCAGAACCACATCATTAAAAGGTT 

2041 + + + + + + 2100 

TAGMGGGTCCCCA6GTGGAGGGMATATTTGGGGTGGTCTTGGTGTAGTAA7TTTCCAA 



TGACTGCGCACCAAAAAAAAAAAAAAA 

2101 + + 2100 

ACTGACGCGTGG 1 1 1 1 1 ITTTTTTTTT 



FIG. ID 
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CTGCACCTGAAGCATTTGGTGGGT6AGCAGCATGGGCTTTGAGGAGCTGCTG6AGCAGGT 

2 - + + +-- --+ +-- -+ 60 

GACGrGGACTTCGTAAACCACCCACTCGTCGTACCCGAAACTCCTCGACGACCTCGTCCA 

MGFEELLEQV- 

GGGCGGCTTTGGGCCCTTCCAACTGCGGAATGTGGCACTGCTGGCCCTGCCCCGAGTGCT 

51 + + + + +- + 120 

CCCGCCGAAACCCGGGAAGGTTGACGCCTTACACCGTGACGACCGGGACGGGGCTCACGA 

GGFGPFQLRNVALLALPRVL - 

GCTACCACTGCACTTCCTCCTGCCCATCTTCCTGGCTGCCGTGCCTGCCCACCGATGTGC 

121 + + + + + ----+ 180 

CGATGGTGACGTGAAGGAGGACGGGTAGAAGGACCGACGGCACGGACGGGTGGCTACACG 

LPLHFLLPI FLAAVPAHRCA- 

CCTGCCGGGTGCCCCTGCCAACTTCAGCCATCAGGATGTGTGGCTGGAGGCCCATCTTCC 

181 +-- + + --+-- +--- + 240 

GGACGGCCCACGGGGACGGTTGAAGTCGGTAGTCCTACACACCGACCTCCGGGTAGAAGG 

LPGAPANFSHQDVWL E ( A H L P - 

CCGGGAGCCTGATGGCACGCTCAGCTCCTGCCTCCGCTTTGCCTATCCCCAGGCTCTCCC 

241 + + + + + + 300 

GGCCCTCGGACTACCGTGCGAGTCGAGGACGGAGGCGAAACGGATAGGGGTCCGAGAGGG 

REPDGTLSSCLRFAYPQALP - 

CAACACCACGTTGGGGGAAGAAAGGCAGAGCCGTGGGGAGCTGGAGGATGAACCTGCCAC 

301 + + ---+ + + + 360 

GTTGTGGTGCAACCCCCTTCTTTCCGTCTCGGCACCCCTCGACCTCCTACTTGGACGGTG 

NTTLGEE RQSRGELEDEPAT- 

AGTGCCCTGCTCTCAGGGCTGGGAGTACGACCACTCAGAATTCTCCTCTACCATTGCAAC 

361 -+ + + + + + 420 

TCACGGGACGAGAGTCCCGACCCTCATGCTGGTGAGTCTTAAGAGGAGATGGTAACGTTG 

VPCSQGWEYDHSEFSSTIAT- 

TGAGTGGGATCTGGTGTGTGAGCAGAMGGTCTGMCAGAGCTGCGTCCACTTTCrTCTT 

421 + + + + -+ + 480 

ACTCACCCTAGACCACACACTCGTCTTTCCAGACrTGTCTCGACGCAGGTGAAAGAAGAA 

EWDLVCEQfCGLNRAASTFFF- 

CGCCGGTGTGCTGGTGGGGGCTGTGGCCTTTGGATATCTGTCCGACAGGTTTGGGCGGCG 

481 + + + + + + 540 

GCGGCCACACGACCACCCCCGACACCGGAAACCTATAGACAGGCTGTCCAAACCCGCCGC 

AGVLVGAVAFGy lsdrfgrr- 
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GCGTCTGCTGCTGGTAGCCTACGTGAGTACCCTGGTGCTGGGCCTGGCATCTGCAGCCTC 
541 + ---+ + + + + 600 

CGCAGACGACGACCATCGGATGCACTCATGGGACCACGACCCGGACCGTAGACGTCGGAG 

RLLLVAYVSTLVLGL ASAAS - 

CGTCAGCTATGTAATGTTTGCCATCACCCGCACCCTTACTGGCTCAGCCCTGGCTGGTTT 

601 + + + + + + 660 

GCAGTCGATACATTACAAACGGTAGTGGGCGTGGGAATGACCGAGTCGGGACCGACCAAA 

VSYVMFAITRTLTGSALAGF- 

TACCATCATCGTGATGCCACTGGAGCTGGAGTGGCTGGATGTGGAGCACCGCACCGTGGC 
661 + + + + + — + 720 

ATGGTAGTAGCACTACGGTGACCTCGACCTCACCGACCTACACCTCGTGGCGTGGCACCG 
TI IVMPLELEWLDVEHRTVA- 

TGGAGTCCTGAGCAGCACCTTCTGGACAGGGGGCGTGATGCTGCTGGCACTGGTTGGGTA 

721 + ---+ + + --+ + 780 

ACCTCAGGACTCGTCGTGGAAGACCTGTCCCCCGCACTACGACGACCGTGACCAACCCAT 

GVLSSTFWTGGVMLLAL VG Y - 

CCTGATACGGGACTGGCGATGGCTTCTGCTAGCTGTCACCCTGCCTTGTGCCCCAAGCAT 

781 + + + + + + 840 

GGACTATGCCCTGACCGCTACCGAAGACGATCGACAGTGGGACGGAACACGGGGTTCGTA 

LIRDWRWLLLAVTLPCAPSI - 

CCTCAGCCTCTGGTGGGTGCCTGAGTCTGCACGCTGGCTTCTGACCCAAGGCCATGTGAA 
841 + + + + --+ + 900 

GGAGTCGGAGACCACCCACGGACTCAGACGTGCGACCGAAGACTGGGTTCCGGTACACTT 

LSLWWVPESARWLLTQGHVK- 

AGAGGCCCACAGGTACTTGCTCCACTGTGCCAGGCTCAATGGGCGGCCAGTGTGTGAGGA 
901 +-- + + + + + 960 

TCTCCGGGTGTCCATGAACGAGGTGACACGGrCCGAGTTACCCGCCGGTCACACACTCCT 

EAHRYLLHCARLNGRPVCED- 

CAGCTTCAGCCAGGAGGCTGTGAGCAAAGTGGCCGCCGGGGAACGGGTGGTCCGAAGACC 
961 + + + + + + 1020 

GTCGAAGTCGGTCCTCCGACACTCGTTTCACCGGCGGCCCCTTGCCCACCAGGCTTCTGG 
SFSQEAVSKVAAGERVVRRP- 

TTCATACCTAGACCTGTTCCGCACACCACGGCTCCGACACATCTCACTGTGCTGCGrGGT 
1021 + + + + + + 1080 

AAGTATGGATCTGGACAAGGCGTGTGGTGCCGAGGCTGTGTAGAGTGACACGACGCACCA 
SYLDL FRTPRLRHISLCCV V- 
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GGTGTGGTTCGGA6TGAACTTCTCCTATTACGGCCTGAGTCTGGATGTGTCGG6GCTGGG 

1081 + + + + + --+ 1140 

CCACACCAAGCCTCACTTGAAGAGGATAATGCCGGACTCAGACCTACACAGCCCCGACCC 

VWFGVNFSYYGLSLDVSGLG- 

GCTGAACGTGTACCAGACACAGCTGTTGTTCGGGGCTGTGGMCTGCCCTCCAAGCTGCT 

1141 -+ + --+ + -+ + 1200 

CGACTTGCACATGGTCTGTGTCGACAACAAGCCCCGACACCTTGACGGGAGGTrCGACGA 

LNVYQTQLLFGAVELPSKLL- 

GGTCTACTTGrCGGTGCGCTACGCAGGACGCCGCCTCACGCAAGCCGGGACACTGCTGGG 

1201 + + + + + + 1260 

CCAGATGAACAGCCACGCGATGCGTCCTGCGGCGGAGTGCGTTCGGCCCTGTGACGACCC 

VYLSVRYAGRRLTQAGTLLG- 

CACGGCCCTGGCGTTCGGCACrAGACTGCTAGTGTCCTCCGATATGAAGTCCTGGAGCAC 

1261 — + — + +- — + + + 1320 

GTGCCGGGACCGCAAGCCGTGATCTGACGATCACAGGAGGCTATACTTCAGGACCTCGTG 

TALAFGTRLLVSSDMKSWST- 

TGTCCTGGCAGTGATGGGGAAAGCTTTTTCTGAAGCTGCCTTCACCACTGCCTACCTGTT 

1321 + -+ * + + + 1380 

ACAGGACCGTCACTACCCCTTTCGAAAAAGACTTCGACGGAAGTGGTGACGGATGGACAA 

VLAVMGKAFSEAAFTTAYLF- 

CACTTCAGAGTTGTACCCTACGGTGCTCAGACAGACAGGGATGGGGCTGACTGCACTGGT 

1381 + + + +--- + --+ 1440 

GTGAAGTCTCAACATGGGATGCCACGAGTCTGTCTGTCCCTACCCCGACTGACGTGACCA 

TSELYPTV LRQTGMGLTAL V - 

GGGCCGGCTGGGGGGCTCTTTGGCCCCACTGGCGGCCTTGCTGGATGGAGTGTGGCTGTC 
1441 - + + + + + + 1500 

CCCGGCCGACCCCCCGAGAAACCGGGGTGACCGCCGGAACGACCTACCTCACACCGACAG 
GRLGGSLAPLAALLDGVWLS- 

ACTGCCCAAGCTTACTTATGGGGGGATCGCCCTGCTGGCTGCCGGCACCGCCCTCCTGCT 

1501 + + + + + + 1560 

TGACGGGTTCGAATGAATACCCCCCTAGCGGGACGACCGACGGCCGTGGCGGGAGGACGA 

LPKLTYG G I ALLAAGTALLL - 

GCCAGAGACGAGGCAGGCACAGCTGCCAGAGACCATCCAGGACGTGGAGAGAAAGAGTGC 

1561 + +--- + + + + 1620 

CGGTCTCTGCTCCGTCCGTGTCGACGGTCTCTGG'AGGTCCTGCACCTCTCTnCTCACG 

PETRQAQLPETIODVERKSA- 
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CCCAACCAGTCTTCA6GAGGAAGA6ATGCCCATGAAGCAGGTCCAGAACTAAGTGGGAGT 

1621 +-- + * + + + 1680 

GGGTTGGTCAGAAGTCCTCCTTCTCTACGGGTACTTCGTCCAGGTCTTGATTCACCCTCA 

PTSLQEEEMPMKQVQN* 

GGAGGCAGGCCCTCCACAGAAGCTCTGCAGCAGGGGCTGGGAGAGCAGAAGGGCAGGCCC 

1681 +--- + + + + + 1740 

CCTCCGTCCGGGAGGTGTCTTCGAGACGTCGTCCCCGACCCTCTCGTCTTCCCGTCCGGG 

TGCAACTCAGGCTGGGAGTATCGAACCCTCTGCCTAGGGCCGGAGTTGCTGCCAGTACCC 

1741 + + + + + + 1800 

ACGTTGAGTCCGACCCTCATAGCTTGGGAGACGGATCCCGGCCTCAACGACGGTCATGGG 

GCTCCCTCTGCTCATCCATCCTTGATTATTTGGCTTCTAGGAACAGTTGACTTCCCAGAA 

1801 + + + + + + 1860 

CGAGGGAGACGAGTAGGTAGGAACTAATAAACCGAAGATCCTTGTCAACTGAAGGGTCTT 

TGCAGTGGGCTGCTGGGCACCCCTCTCACGGTTGGGGAGGATTCTGTAAATAAAGGTGCC 

1861 + +- + + + + 1920 

ACGTCACCCGACGACCCGTGGGGAGAGTGCCAACCCCTCCTAAGACATTTATTTCCACGG 

CCTTGGGTTGGGGCAATGGTGACGAGCTGTGGGAAGAGCCCTGGATAGGAAGCCACTGAG 

1921 + + +-- -+ --+ + 1980 

GGAACCCAACCCCGTTACCACTGCTCGACACCCTTCTCGGGACCTATCCTTCGGTGACTC 

TCTGCCCTGGGCTCTGATAAAACCTTCACCATTAACTTGCTGTGTGACCTTGGGCATGTG 
1981 + + + + + + 2040 

AGACGGGACCCGAGACTATTTTGGAAGTGGTAATTGAACGACACACTGGAACCCGTACAC 

GCTTTCCCTCTCTGGCCTCAGTCTGTTCATCTCCCAAATGGATAATGAAGCCTCTTGGGA 

2041 + + + + + + 2100 

CGAAAGGGAGAGACCGGAGTCAGACAAGTAGAGGGTTTACCTATTACTTCGGAGAACCCT 

GGCCCTACCATAGGATCTGTTGCCATGCTCAAATGAGTTACTGAATAAGGTGCTTCTGCT 

2101 + + + + + + 2160 

CCGGGATGGTATCCTAGACAACGGTACGAGTTTACTCAATGACTTATTCCACGAAGACGA 

TCTTCTAGAGATGGTGCTAAAGAAAGGACTAGCATATGAGACTTCrGGTACCAATGGGGC 
2161 + + + + + + 2220 

AGAAGATCTCTACCACGATTTCTTTCCTGATCGTATACTCTGAAGACCATGGTTACCCCG 

TGGTGGGCATGCTGTCCACTGTGTGGTGCTAGGACTGCCAATGCCAGGCCCAAGGGACAA 

2221 + + + + + + 2280 

ACCACCCGTACGACAGGTGACACACCACGATCCTGACGGTTACGGTCCGGGTTCCCTGTT 

AAAGAACAGAGCTTTTTGTTCTCATGGCTGGCCCTGCTACCTCCGAGGCACCCTGCAGGG 
2281 + + + + + + 2340 

TTTCTTGTCTCGAAAAACAAGAGTACCGACCGGGACGATGGAGGCTCCGTGGGACGTCCC 

CAATGCATGTCATCCCAACCCCCACACTCCCCATCCTCCAACCCACTGGTCTCATGCCCA 

2341 + + + + + + 2400 

GTTACGTACAGTAGGGTTGGGGGTGTGAGGGGTAGGAGGTTGGGTGACCAGAGTACGGGT 
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AAGAAGA6TTGAAGGCATGGGAGCCAACATTTTATTGAAGAAGCCACAGAGGCTGAAATT 

2401 + + + + + + 2460 

TTCTTCTCAACTTCCGTACCCTCGGTTGTAAAATAACTTCTTCGGTGTCTCCGACTTTAA 

CAATAAACACAAGTTTTATGAGTAAAAAAAAAAAAAAAAAA 

2461 + + --+ +- 2501 

GTTATTTGTGTTCAAAATACTCA 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



FIG. 2E 
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CTGCACCTGAAGCATTT6GT66GTGAGCAGCATGGGCTTTGAGGAGCTGCT6GAGCAGGT 

1 + + + + + — + 60 

GACGTGGACTTCGTAAACCACCCACTCGTCGTACCCGAAACTCCTCGACGACCTCGTCCA 

MGFEELLEQV- 

GGGCGGCrTTGGGCCCTTCCAACTGCGGAATGTGGCACTGCTGGCCCTGCCCCGAGTGCT 

61 + + + + + + 120 

CCCGCCGAAACCCGGGAAGGTTGACGCCTTACACCGTGACGACCGGGACGGGGCTCACGA 

GGFG P FQ L RNVALLAL P RV L - 

GCTACCACTGCACTTCCTCCTGCCCATCTTCCTGGCTGCCGTGCCTGCCCACCGATGTGC 

121 + + + + + + 180 

CGATGGTGACGTGAAGGAGGACGGGTAGAAGGACCGACGGCACGGACGGGTGGCTACACG 

LPLHFILP I FLAAVPAHRCA- 

CCTGCCGGGTGCCCCTGCCAACTTCAGCCATCAGGATGTGTGGCTGGAGGCCCATCTTCC 

181 + + + + + + 240 

GGACGGCCCACGGGGACGGTTGAAGTCGGTAGTCCTACACACCGACCTCCGGGTAGAAGG 

LPGAPAN FSHOD V W L E A H L P - 

CCGGGAGCCTGATGGCACGCTCAGCTCCTGCCTCCGCTTTGCCTATCCCCAGGCTCTCCC 

241 + + + + + + 300 

GGCCCTCGGACTACCGTGCGAGTCGAGGACGGAGGCGAAACGGATAGGGGTCCGAGAGGG 

REPDGTLSSCLRFAYPQALP- 

CAACACCACGTTGGGGGAAGAAAGGCAGAGCCGTGGGGAGCTGGAGGATGAACCTGCCAC 

301 + +- + + +-- --+ 360 

GTTGTGGTGCAACCCCCTTCTTTCCGTCTCGGCACCCCTCGACCTCCTACTrGGACGGTG 

NTTLGEERQSRGELEDEPAT- 

AGTGCCCTGCTCTCAGGGCTGGGAGTACGACCACTCAGAATTCTCCTCTACCATTGCAAC 

361 + + + + + + 420 

TCACGGGACGAGAGTCCCGACCCTCATGCTGGTGAGTCTTAAGAGGAGATGGTAACGTTG 

VPCSOGWEYDHSEFSSTIAT- 

TGAGTGGGATCTGGTGTGTGAGCAGAMGGTCTGMCAGAGCTGCGTCCACTTTCTTCTT 

421 + + *- + + + 480 

ACrCACCCTAGACCACACACTCGTCTTTCCAGACTTGTCTCGACGCAGGTGAAAGAAGAA 

EWDLVCEOkGLNRAASTFFF- 

CGCCGGTGTGCTGGTGGGGGCTGTGGCCTTTGGATATCTGTCCGACAGGTrTGGGCGGCG 

481 +■ + + + + + 540 

GCGGCCACACGACCACCCCCGACACCGGAAACCTATAGACAGGCTGTCCAAACCCGCCGC . 

AGVLVGAVAFGYLSDRFGRR- 
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GC6TCTGCTGCTGGTAGCCTACGTGA6TACCCTGGTGCTGGGCCTGGCATCT6CAGCCTC 

541 + + + + + + 600 

CGCAGACGACGACCATCGGATGCACrCATGGGACCACGACCCGGACCGTAGACGTCGGAG 

RLLLVAYVSTLVLGL ASAAS - 

CGTCAGCTATGTMTGTTTGCCATCACCCGCACCCTTACTGGCTCAGCCCTGGCTGGTTT 

501 + + +- + + + 660 

GCAGTCGATACATTACAAACGGTAGTGGGCGTGGGAATGACCGAGTCGGGACCGACCAAA 

VS YVMFAITRTLTGSALAGF- 

TACCATCATCGTGATGCCACTGGAGCTGGAGTGGCTGGATGTGGAGCACCGCACCGTGGC 

661 + + -+ + + + 720 

ATGGTAGTAGCACTACGGTGACCTCGACCTCACCGACCTACACCTCGTGGCGTGGCACCG 

TI I VMPLELEWLDVEHRTVA- 

TGGAGTCCTGAGCAGCACCTTCTGGACAGGGGGCGTGATGCTGCTGGCACTGGTTGGGTA 

721 - + + + + + + 780 

ACCTCAGGACTCGTCGTGGAAGACCTGTCCCCCGCACTACGACGACCGTGACCAACCCAT 

GVLSSTFWTGGVM- LLALVGY - 

CCTGATACGGGACTGGCGATGGCTTCTGCTAGCTGTCACCCTGCCTTGTGCCCCAAGCAT 

781 + + + — + + + 840 

GGACTATGCCCTGACCGCTACCGAAGACGATCGACAGTGGGACGGAACACGGGGTTCGTA 

LIRDWRWLLLAVTLPCAPSI - 

CCTCAGCCTCTGGTGGGTGCCTGAGTCTGCACGCTGGCTTCTGACCCAAGGCCATGTGM 

841 + ---+ + + + + 900 

GGAGTCGGAGACCACCCACGGACTCAGACGTGCGACCGAAGACTGGGTTCCGGTACACTT 

LSLWWVPESARWLLTQGHVK- 

AGAGGCCCACAGGTACTTGCTCCACTGTGCCAGGCTCAATGGGCGGCCAGTGTGTGAGGA 

901 + + + + + + 960 

TCTCCGGGTGTCCATGAACGAGGTGACACGGTCCGAGTTACCCGCCGGTCACACACTCCT 

E AHRYLLHCARLNGRPVCED- 

CAGCTTCAGCCAGGAGGCTGTGAGCAAAGTGGCCGCCGGGGAACGGGTGGTCCGAAGACC 

q61 + + + + + 1020 

GTCGAAGTCGGTCCTCCGACACTCGTTTCACCGGCGGCCCCTTGCCCACCAGGCTTCTGG 

SFSQEAVSKVAAG.ERVVRRP- 

TTCATACCTAGACC TG TTCCGCACACCACGGCTCCGACACATCTCAC TGTGCTGCGTGGT 

1021 + + -+ + + + 1080 

AAGTATGGATCTGGACAAGGCGTGTGGTGCCGAGGCTGTGTAGAGTGACACGACGCACCA 

SYLDLFRTPRLRH I SLCCVV- 
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GGTGTGGTTCGGAGTGAACTTCTCCTATrACGGCCTGAGTCTGGATGTGTCGGGGCTGGG 

1081 + + + + + + 1140 

CCACACCAAGCCTCACTTGAAGAGGATAATGCCGGACTCAGACCTACACAGCCCCGACCC 

VWFGVNFSYYGLSLDVSGLG- 

GCTGAACGTGTACCAGACACAGCTGTTGTTCGGGGCTGTGGMCTGCCCTCCAAGCTGCT 

1141 + ----+ + + + + 1200 

CGACTTGCACATGGTCTGTGTCGACAACAAGCCCCGACACCTTGACGGGAGGTTCGACGA 

LNVYQTQLLFGAVELPSKLL- 

GGTCTACTTGTCGGTGCGCTACGCAGGACGCCGCCTCACGCAAGCCGGGACACTGCTGGG 

1201 + + + + + + 1260 

CCAGATGAACAGCCACGCGATGCGTCCTGCGGCGGAGTGCGTTCGGCCCTGTGACGACCC 

VYLSVRYAGRRLTQAGTLLG- 

CACGGCCCTGGCGTTCGGCACTAGACTGCTAGTGTCCTCCGATATGAAGTCCTGGAGCAC 

1261 + + + + + + 1320 

GTGCCGGGACCGCAAGCCGTGAfCTGACGATCACAGGAGGCTATACTTCAGGACCTCGTG 

TALAFGTRLLVSS. DMKSWST- 

TGTCCTGGCAGTGATGGGGAAAGCTTTTTCTGAAGCTGCCTTCACCACTGCCTACCTGTT 

1321 --- + + + + + + 1380 

ACAGGACCGTCACTACCCCTTTCGAAAAAGACTTCGACGGMGTGGTGACGGATGGACAA 

VLAVMGKAFSEAAFTTAYLF- 

CACTTCAGAGTTGTACCCTACGGTGCTCAGACAGACAGGGATGGGGCTGACTGCACTGGT 

1381 + + — + + + + 1440 

GTGAAGTCTCAACATGGGATGCCACGAGTCTGTCTGTCCCTACCCCGACTGACGTGACCA 

TSELYPTVLRQTGMGLTALV- 

GGGCCGGCTGGGGGGCTCTTTGGCCCCACTGGCGGCCTTGCTGGATGGAGTGTGGCTGTC 

1441 + + + + + -+ 1500 

CCCGGCCGACCCCCCGAGAAACCGGGGTGACCGCCGGAACGACCTACCTCACACCGACAG 

GRLGGSLAPLAALLDGVWL S- 

ACTGCCCAAGCTTACTTATGGGGGGATCGCCCTGCTGGCTGCCGGCACCGCCCTCCTGCT 

1501 + + + + + 1560 

TGACGGGTTCGAATGAATACCCCCCTAGCGGGACGACCGACGGCCGTGGCGGGAGGACGA 

LPKLTYGGIALLAAGTALLL- 

GCCAGAGACGAGGCAGGCACAGCTGCCAGAGACCATCCAGGACGTGGAGAGAAAGAGAGA 

1561 + +--- + + + + 1620 

CGGTCTCTGCTCCGTCCGTGTCGACGGTCTCTGGTAGGTCCTGCACCTCTCTTTCTCTCT 

P E TRQAQLPET1QDVERK R D - 
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TGGT6CTAAAGAAAGGACTAGCATATGAGACTTCTGGTACCAATGGGGCTGGTGGGCATG 

2521 + + + + + — + 1680 

ACCACGATTrCTTTCCTGATCGTATACTCTGAAGACCATGGTTACCCCGACCACCCGTAC 



GAKERTS I 



* 



CTGTCCACTGTGTGGTGCTAGGACTGCCAATGCCAGGCCCAAGGGACAAAAAGAACAGAG 

X681 + + + + --+-- + 1740 

GACAGGTGACACACCACGATCCTGACGGTTACGGTCCGGGTTCCCTGTTTTTCTTGTCTC 

CTTTTTGTTCTCATGGCTGGCCCTGCTACCTCCGAGGCACCCTGCAGGGCAATGCATGTC 

1741 + + +-- -+ + + 1800 

GAAAAACAAGAGTACCGACCGGGACGATGGAGGCTCCGTGGGACGTCCCGTTACGTACAG 

ATCCCAACCCCCACACTCCCCATCCTCCAACCCACTGGTCTCATGCCCAAAGAAGAGTTG 

1801 + + + + -+- + 1860 

TAGGGTTGGGGGTGTGAGGGGTAGGAGGTTGGGTGACCAGAGTACGGGTTTCTTCTCAAC 

AAGGCATGGGAGCCAACATTTTATTGAAGAAGCCACAGAGGCTGAAATrCAATAAACACA 

1861 + + + + + + 1920 

TTCCGTACCCTCGGTTGTAAMTMCTTCTTCGGTGTCTCCGACTTTAAGTTATTTGTGT 

AGTTTTATGAGTAAAAAAAAAAAAAAAAAA 

1921 --- + + + 1980 

TCAAAATACTCAI 1 1 1 1 1 1 1 1 1 Mill III 
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GCAGGGACCTCAACTACACTGATCACCAGCCCCATCGGATCCAGACCCGGCCACCAGTGC 
I + + + + h + 60 

CGTCCCTGGAGTTGATGTGACTAGTGGTCGGGGTAGCCTAGGTCTGGGCCGGTGGTCACG 

C ATG ACC T rCTCGGAGATCCTGGACCGTGTGGGAAGCATGGGCCAT rTCCAGTTCCTGCA 

61 + + + + +-- + 120 

GTACTGGAAGAGCCTCTAGGACCTGGCACACCCTTCGTACCCGGTAAAGGTCAAGGACGT 

MTFSEILDRVGSMGHFQFLH - 

TGTAGCCATACTGGGCCTCCCGATCCTCAACATGGCCAACCACAACCTGCTGCAGATCTT 

121 + + + + + ---+ 180 

ACATCGGTATGACCCGGAGGGCTAGGAGTTGTACCGGTTGGTGTTGGACGACGTCTAGAA 

CACAGCCGCCACCCCTGTCCACCACTGTCGCCCGCCCCACAATGCCTCCACAGGGCCTTG 

181 + + + + +- ---+ 240 

GTGTCGGCGGTGGGGACAGGTGGTGACAGCGGGCGGGGTGTTACGGAGGTGTCCCGGAAC 

GGTGCTCCCCATGGGCCCAAATGGGAAGCCTGAGAGGTGCCTCCGTTTTGTACATCCGCC 

241 + + + + + + 300 

CCACGAGGGGTACCCGGGTTTACCCTTCGGACTCTCCACGGAGGCAAAACATGTAGGCGG 

CAATGCCAGCCTGCCCAATGACACCCAGAGGGCCATGGAGCCATGCCTGGATGGCTGGGT 

301 + + + + + + 360 

GTTACGGTCGGACGGGTTACTGTGGGTCTCCCGGTACCTCGGTACGGACCTACCGACCCA 

CTACAACAGCACCAAGGACTCCATTGTGACAGAGTGGGACTTGGTGTGCAACTCCAACAA 

361 --- + + + + + + 420 

GATGTTGTCGTGGTTCCTGAGGTAACACTGTCTCACCCTGAACCACACGTTGAGGTTGTT 

ACTGAAGGAGATGGCCCAGTCTATCTTCATGGCAGGTATACTGATTGGAGGGCTCGTGCT 

421 + + + +--- + + 480 

TGACTTCCTCTACCGGGTCAGATAGAAGTACCGTCCATATGACTAACCTCCCGAGCACGA 

TGGAGACCTGTCTGACAGGTTTGGCCGCAGGCCCATCCTGACCTGCAGCTACCTGCTGCT 

481 + + + + +- + 540 

ACCTCTGGACAGACTGTCCAAACCGGCGTCCGGGTAGGACTGGACGTCGATGGACGACGA 

GGCAGCCAGCGGCTCCGGTGCAGCCTTCAGCCCCACCTTCCCCATCTACATGGTCTTCCG 
541 + + *- + + + 600 
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CCGTCGGTC6CCGAGGCCACGTCGGAAGTCGGGGTGGAAGGGGTAGATGTACCA6AAGGC 
AASGSGAAFSPTF PIYMVFR- 

CTTCCTGTGTGGCTTTGGCATCTCAGGCATTACCCTGAGCACCGTCATCTTGAATGTGGA 

gni + + + + + + 660 

GAAGGACACACCGAAACCGTAGAGTCCGTAATGGGACTCGTGGCAGTAGAACTTACACCT 

FLCGFGISGITLSTVILNVE - 

ATGGGTGCCTACCCGGATGCGGGCCATCATGTCGACAGCACTCGGGTACTGCTACACCTT 

cgi + + + + + + 720 

TACCCACGGATGGGCCTACGCCCGGTAGTACAGCTGTCGTGAGCCCATGACGATGTGGAA 

WVPTRM RAIMSTALGYCYTF - 

TGGCCAGTTCATTCTGCCCGGCCTGGCCTACGCCATCCCCCAGTGGCGTTGGCTGCAGTT 

721 +-- + + + — + + 780 

ACCGGTCAAGTAAGACGGGCCGGACCGGATGCGGTAGGGGGTCACCGCAACCGACGTCAA 

GQF I LPGLAYAI PQWRWLQL - 

AACTGTGTCCATTCCCTTCTTCGTCTTCTTCCTATCATCCTGGTGGACACCAGAGTCCAT 

■jqI __ + _+ + + + + 840 

TTGACACAGGTAAGGGAAGAAGCAGAAGAAGGATAGTAGGACCACCTGTGGTCTCAGGTA 

T VSIPFFVFFLSSWWTPESI- 

ACGCTGGTTGGTCTTGTCTGGAAAGTCCTCGAAGGCCCTGAAGATACTCCGGCGGGTGGC 

84i + + + + + + 900 

TGCGACCAACCAGAACAGACCTTTCAGGAGCTTCCGGGACTTCTATGAGGCCGCCCACCG 

RWLVLSGKSSKALKILRRVA- 

TGTCTTCAATGGCAAGAAGGAAGAGGGAGAAAGGCTCAGCTTGGAGGAGCTCAAACTCAA 

+ + + + — -+- --+ 960 

ACAGMGTTACCGTTCTTCCTTCTCCCTCTTTCCGAGTCGMCCTCCTCGAGTTTGAGTT 

VFNGKKEEGERLSLEELKLN- 

CCTGCAGAAGGAGATCTCCTTGGCCAAGGCCAAGTACACCGCAAGTGACCTGTTCCGGAT 

951 +_ + ¥ + -+ 1020 

GGACGTCTTCCTCTAGAGGMCCGGTTCCGGTTCATGTGGCGTTCACTGGACAAGGCCTA 

LQ KE I SLAKAKYTASDLFRI - 

ACCCATGCTGCGCCGCATGACCTTCTGTCTTTCCCTGGCCTGGT1TGCTACCGGTTTTGC 

1Q21 -_ __+ 4- + -+ + + 1080 

TGGGTACGACGCGGCGTACTGGAAGACAGAAAGGGACCGGACCAAACGATGGCCAAAACG 



1081 



PMLRRMTFCLSLAWFATGFA 
CTACTATAGTTTGGCTATGGGTGTGGAAGAATTTGGAGTCAACCTCTACATCCTCCAGAT 



A 4- 



+ + 1140 
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GATGATATCAAACCGATACCCACACCTTCTTAMCCTCAGTTGGAGATGTAGGAGGfCTA 
YYSLAMGVEEFGVNLY I LQI - 

CATCTTrGGTGGGGTCGATGTCCCAGCCAAGTTCATCACCATCCTCTCCTTAAGCTACCT 

1141 + + + + + + 1200 

GTAGAAACCACCCCAGCTACAGGGTCGGTTCAAGTAGTGGTAGGAGAGGAATTCGATGGA 

IFGGVDVPAKFITILSISYL- 

GGGCCGGCATACCACTCAGGCCGCTGCCCTGCTCCTGGCAGGAGGGGCCATC7TGGCTCT 

1201 -+ + + + + + 1260 

CCCGGCCGTATGGTGAGTCCGGCGACGGGACGAGGACCGTCCTCCCCGGTAGAACCGAGA 

GRHTTQAAALLLAGGAI LAL - 

CACCTTTGTGCCCTTGGACTTGCAGACCGTGAGGACAGTATTGGCTGTGTTTGGGAAGGG 

1261 + + + +-- + + 1320 

GTGGAAACACGGGAACCTGAACGTCTGGCACTCCTGTCATAACCGACACAAAGCCTTCCC 

FFVPLDLQTVRTVLAVFGKG- 

ATGCCTATCCAGCTCCTTCAGCTGCCTCTTCCTCTACACAAGTGAATTATACCCCACAGT 

1321 + + + + + + 1380 

TACGGATAGGTCGAGGAAGTCGACGGAGAAGGAGATGTGTTCACTTAATATGGGGTGTCA 

CLSSSFSCLFLYTSELYPTV- 

CATCAGGCAAACAGGTATGGGCGTAAGTAACCTGTGGACCCGCGTGGGAAGCATGGTGTC 

1381 + + + + + + 1440 

GTAGTCCGTTTGTCCATACCCGCATTCATTGGACACCTGGGCGCACCCTTCGTACCACAG 

I RQTGMGVSNLWTRVGSMVS- 

CCCGCTGGTGAAAATCACGGGTGAGGTACAGCCCTTCATCCCCAATATCATCTACGGGAT 

1441 +- + + + + + 1500 

GGGCGACCACTTTTAGTGCCCACTCCATGTCGGGAAGTAGGGGTTATAGTAGATGCCCTA 

PL VKITGEVQP FIPNIIY GI- 

CACCGCCCTCCTCGGGGGCAGTGCTGCCCTCTTCCTGCCTGAGACCCTGAATCAGCCCTT 

1501 + -+ + + + + 1560 

GTGGCGGGAGGAGCCCCCGTCACGACGGGAGAAGGACGGACTCTGGGACTTAGTCGGGAA 

TALLGGSAALFLPETLNQPL- 

GCCAGAGACTATCGAAGACCTGGAAAACTGGTCCCTGCGGGCAAAGAAGCCAAAGCAGGA 

1561 + + + + +-- 4- 1620 

CGGTCTCTGATAGCTTCTGGACCTTTTGACCAGGGACGCCCGTTTCTTCGGTTTCGTCCT 

PETIEDLENW5LRAKKPK0E- 

GCCAGAGGTGGAAAAGGCCTCCCAGAGGATCCCTCTACAGCCTCACGGACCAGGCCTGGG 
1621 + + + + + + 1680 
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CGGTCTCCACCTTTTCCGGAGGGTCTCCTAGGGAGATGTCGGAGTGCCTGGTCC6GACCC 
PEVEKASQRI PLOPHGPGLG- 

CTCCAGCTGAGGACAACGGAACCCCCTTTCCCTGCCCTCCAGAGACTGATCCTAGCCAGG 

1681 + + — + + +- + 1740 

GAGGTCGACTCCTGTTGCCTTGGGGGAAAGGGACGGGAGGTCTCTGACTAGGATCGGTCC 

S S * 

CACCTTAGGAGTATAGGGAGGCCCCATATAGGTCCATCCTCCTAGGATGAAGCCTTCTGA 

1741 -- + + — + + + + 1800 

GTGGAATCCTCATATCCCTCCGGGGTATATCCAGGTAGGAGGATCCTACTTCGGAAGACT 

GAGCTTGGTGAAGGTGTCTCCATCACCACCACCAGAGCCTCCTGCCCAGCCCTGGCCAGT 

1801 + +- + + — + + 1860 

CTCGAACCACTTCCACAGAGGTAGTGGTGGTGGTCTCGGAGGACGGGTCGGGACCGGTCA 

TCAAAGGTTCAAGCCATCCCTGCCCTTGTTCTCCCTGCAACCCAAGCCCTGCCATTCTTC 

1861 + + *■ + ---+ + 1920 

AGTTTCCAAGTTCGGTAGGGACGGGAACAAGAGGGACGTTGGGTTCGGGACGGTAAGAAG 

TGTCTAGCCCTTCCCCACTGGCCAACTTCCCCCACTGTCCCGGTCCTCTTCCCCTGAGGT 

1921 + + + + + 1980 

ACAGATCGGGAAGGGGTGACCGGTTGAAGGGGGTGACAGGGCCAGGAGAAGGGGACTCCA 

CCCCTGATATCCCCTGGCTCAGTCCTAACAAGACTGAGTCTTAACAAGATGAGAAGTCCT 

1981 + + + + + + 2040 

GGGGACTATAGGGGACCGAGTCAGGATTGTTCTGACTCAGAATTGTTCTACTC7TCAGGA 

CCCCTTCTTGCCTCCCACACTTTTCTT rGATGGGAGGTTTCAATAAACAGCG ATAAGAAC 

2041 + + — + + + + 2100 

GGGGAAGAACGGAGGGTGTGAAAAGAAACTACCCTCCAAAGTTATTTGTCGCTATTCTTG 

TCTAAAAAAAAAAAAAAAAAA 

2101 + +- 2121 

AGATTTI II llllllllllll 
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CAAATTATTTCTTAC6TGACTTTAGAGAAAACGGCTACCTATCT6ACCCCAAAACGACTT 
1 _ -+ + + -+ -+ 60 

GTTTAATAAAGAATGCACTGAAATCTCTTTTGCCGATGGATAGACTGGGGTTTTGCTGAA 

GAGGAAACTGTTTCCACGGTCCTGCTGCAGGGGGGAAGCACAGTCGTCAAGAAGAGAGTG 
61 +■ ---+-.- + + + -+ 120 

CTCCTTTGACAAAGGTGCCAGGACGACGTCCCCCCTTCGTGTCAGCAGTTCTTCTCTCAC 

GGGTCAGGATCAAMCACATTTAGTGTGACTTAGGGAMGAAMCATTTTCCCTCTTTGA 

121 + + + + + + 180 

CCCAGTCCTAGTTTTGTGTAMTCACACTGMTCCCTTTCTTTTGTAAAAGGGAGAAACT 

ACCTCTCTGGATACAGTCATTTTGCCTCTACTTGAGGATCAACTGTTCAACCTCAATGGC 

181 + +--- +--- + + + 240 

TGGAGAGACCTATGTCAGTAAAACGGAGATGAACTCCTAGTTGACAAGTTGGAGTTACCG 

M A - 

CTTTCAGGACCTCCTGGGTCACGCTGGTGACCTGTGGAGATTCCAGATCCTTCAGACTGT 

241 +--- + + + •---+ --+ 300 

GAAAGTCCTGGAGGACCCAGTGCGACCACTGGACACCTCTAAGGTCTAGGAAGTCTGACA 

FQD LLGHAGDLWRFQILQTV- 

TTTTCTCTCMTCTTTGCTGTTGCTACATACCTTCATTTTATGCTGGAGAACTTCACTGC 

301 + + + + + + 360 

AAAAGAGAGTTAGAAACGACAACGATGTATGGAAGTAAAATACGACCTCTrGAAGTGACG 

FLSIFAVATYLHFMLENFTA- 

ATTCATACCTGGCCATCGCTGCTGGGTCCACATCCTGGACAATGACACTGTCTCTGACAA 

361 + + + + + + 420 

TAAGTATGGACCGGTAGCGACGACCCAGGTGTAGGACCTGTTACTGTGACAGAGACTGTT 

FIPGHRCWVHILDNDTVSDN- 

TGACACTGGGGCCCTCAGCCAAGATGCACTCTTGAGAATCTCCATCCCACTGGACTCAAA 

421 — -+ + + + + + 480 

ACTGTGACCCCGGGAGTCGGTTCTACGTGAGAACTCTTAGAGGTAGGGTGACCTGAGTTT 

DTGALSQQALLRISI PLDSN- 

CATGAGGCCAGAGAAGTGTCGTCGCTTTGTTCATCCTCAGTGGCAGCTCCTTCACCTGAA 

481 + + + +-- + + 540 

GTACTCCGGTCTCTTCACAGCAGCGAAACAAGTAGGAGTCACCGTCGAGGAAGTGGACTT 

MRPEKCRRFV HPQWQLLHLM- 

TGGGACCTTCCCCAACACAAGTGACGCAGACATGGAGCCCTGTGTGGATGGCTGGGTGTA 
541 + + + + 600 

ACCCTGGAAGGGGTTGTGTTCACTGCGTCTGTACCTCGGGACACACCTACCGACCCACAT 
GTFPNTSDAOMEPCVDGWVY- 
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TGACAGAATCTCCTTCTCATCCACCATCGGTGACCTGAAGTGGGATCTGGTATGTGACTC 

50i +- --+ + + +-- + 660 

ACTGTCTTAGAGGAAGAGTAGGTGGTAGCCACTGGACTTCACCCTAGACCATACACTGAG 

DR I SFSSTIGDLKWDL VCDS- 

TCAATCACTGACTTCAGTGGCTAAATTTGTATTCATGGCTGGAATGATGTTGGGAGGCAT 

661 + + + + + + 720 

AGTTAGTGACTGAAGTCACCGATTTAAACATAAGTACCGACCTTACTACAACCCTCCGTA 

QSLTSVAKFVFMAGMMLGGI- 

CCAGGTTGCCATTGTTGGCACTTGTGCAGCGTTGGCTCCCACTTTCCTCATTTACTGCTC 

72i + + + + + + 780 

GGTCCAACGGTAACAACCGTGAACACGTCGCAACCGAGGGTGAAAGGAGTAAATGACGAG 

LGVHLSDRFGRSFVLRWCYL- 

CCAGGTTGCCATTGTTGGCACTTGTGCAGCGTTGGCTCCCACTTTCCTCATTTACTGCTC 

781 --+ + + + "-+ + 840 

GGTCCAACGGTAACAACCGTGAACACGTCGCAACCGAGGGTGAAAGGAGTAAATGACGAG 

QVAIVGTCAALAPTFLIYCS- 

AGTACGCTTCTTGTCTGGGATTGCTGCAATGAGCTTCATAACAAATACTATTATGTTAAT 

841 _.._..+__ + + + — + + 900 

TCATGCGAAGAACAGACCCTAACGACGTTACTCGAAGTATTGTTTATGATAATACAATTA 

V RF LSG IAAMSF I TNTI ML I - 

AGCCGAGTGGGCAACACACAGATTCCAGGCCATGGGAATTACATTGGGAATGTGCCCTTC 

90i +-- — + + + + 960 

TCGGCTCACCCGTTGTGTGTCTAAGGTCCGGTACCCTTAATGTAACCCTTACACGGGAAG 

AE WATHRFQAMGITLGMCPS- 

TGGTATTGCATTTATGACCCTGGCAGGCCTGGCTTTTGCCATTCGAGACTGGCATATCCT 

961 + + + + + + 1020 

ACCATAACGTAAATACTGGGACCGTCCGGACCGAAAACGGTAAGCTCTGACCGTATAGGA 

GIAFMTLAGLAFA I RDWHIL- 

CCAGCTGGTGGTGTCTGTACCATACTTTGTGATCTTTCTGACCTCAAGTTGGCTGCTAGA 

1021 + + + + + + 1080 

GGTCGACCACCACAGACATGGTATGAAACACTAGAAAGACTGGAGTTCAACCGACGATCT 

QL. VVSVPYFV I FLTSSWLLE- 

GTCTGCTCGGTGGCTCATTATCAACAATAAACCAGAGGAAGGCTTAAAGGAACTTAGAAA 

1081 + +- -+-- -+ — +- 1140 

CAGACGAGCCACCGAGTMTAGTTGTTATTTGGTCTCCTTCCGMTTTCCTTGAATCTTT 

SARWL I INNKPEEGLKELRK- 
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AGC TGC AC ACAGGAGTGGAATGMGAATGCC AGAGACAC CCTAACCCTGG AGA ITTTGAA 

1141 --+ + + + + + 1200 

TCGACGTGTGTCCTCACCTTACTTCTTACGGTCTCTGTGGGATTGGGACCTCTAAAACTT 

AAHRSGMKNARDTLTLE I L K - 

ATCCACCATGAAAAAAGAACTGGAGGCAGCACAAAAAAAAAAACCTTCTCTGTGTGAAAT 

1201 ---- + + + + + + 1260 

TAGGTGGTACTTTTTTCTTGACCTCCGTCGTG 1 1 1 1 1 1 1 1 1 1 1 GGAAGAGACACACTTTA 

STMKKELEAAQKKKPSLCEM- 

GCTCCACATGCCCMCATATGTAAAAGGATCTCCCTCCTGTCCTTTACGAGATTTGCAAA 

1261 + + + + + + 1320 

CGAGGTGTACGGGTTGTATACATTTTCCTAGAGGGAGGACAGGAAATGCTCTAAACGTTT 

LHMPNICKRISLLSFTRFAN- 

CTTTATGGCCTATTTTGGCCTTMTCTCCATGrCCAGl^TCTGGGGMCMTGTTTTCCT 

1321 + -+ — + +-- + 1380 

GAAATACCGGATAAAACCGGAATTAGAGGTACAGGTCGTAGACCCCTTGTTACAAAAGGA 

FMAYFGLNLHVQH LGNNVFL- 

GTTGCAGACTCTCTTTGGTGCAGTCATCCTCCTGGCCAACTGTGTTGCACCTTGGGCACT 

1381 --+ + +■ + +-- -+ 1440 

CAACGTCTGAGAGAAACCACGTCAGTAGGAGGACCGGTTGACACAACGTGGAACCCGTGA 

LQTLFGAVILLANCVAPWAL- 

GAAATACATGAACCGTCGAGCAAGCCAGATGCTTCTCATGTTCCTACTGGCAATCTGCCT 

1441 + + + +-- + + 1500 

CTTTATGTACTTGGCAGCTCGTTCGGTCTACGAAGAGTACAAGGATGACCGTTAGACGGA 

KYMNRRASQMLLMFLLAICL- 

TCTGGCCATCATAmGTGCCACMGAMTGCAGACGCTGCGTGAGGTTTTGGCAACACT 

1501 +- + + + + + 1560 

AGACCGGTAGTATAMCACGGTGTTCTTTACGTCTGCGACGCACTCCAAAACCGTTGTGA 

LAI I FVPQEMQTLREVLATL- 

GGGCTTAGGAGCGTCTGCTCTTGCCAATACCCTTGCTTTTGCCCATGGAAATGAAGrAAT 

1561 + + + + + + 1620 

CCCGAATCCTCGCAGACGAGAACGGTTATGGGAACGAAAACGGGTACCTTTACTTCATTA 

GLGASALANTLAFAHGNEVI - 

TCCCACCATAATCAGGGCAAGAGCTATGGGGATCAATGCAACCTTTGCTAATATAGCAGG 
1621 + + + + + + 1680 

AGGGTGGTATTAGTCCCGTTCTCGATACCCCTAGTFACGTTGGAAACGATTATATCGTCC 
PTT I RARAMGINATFAN IAG- 
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AGCCCTGGCTCCCCTCATGATGATCCTAAGTGTGTATTCTCCACCCCTGCCCT6GATCAT 

1681 + + + + ---+ + 1740 

TCGGGACCGAGGGGAGTACTACfAGGATTCACACATAAGAGGTGGGGACGGGACCTAGTA 

ALAPLMMILSVYSPPlPWII- 

CTATGGAGTCTTCCCCTTCATCTCTGGCTTTGCTTTCCTCCTCCTTCCTGAAACCAGGAA 

1741 + + + +- --+ + 1800 

GATACCTCAGAAGGGGAAGTAGAGACCGAAACGAAAGGAGGAGGAAGGACTTTGGTCCTT 

YGVFPFISGFAFLLLPETRN- 

CAAGCCTCTGTTTGACACCATCCAGGATGAGAAAAATGAGAGAAAAGACCCCAGAGAACC 

1801 + + +-- + + + I860. 

GTTCGGAGACAMCTGTGGTAGGTCCTACTCTTTTTACTCTCTTTTCTGGGGTCTCTTGG 

KPLFDTIQDEKN ERKDPREP- 

AAAGCAAGAGGATCCGAGAGTGGAAGTGACGCAGTTTTAAGGAATTCCAGGAGCTGACTG 

1861 + + + + + + 1920 

TTTCGTTCTCCTAGGCTCTCACCTTCACTGCGTCAAAATFCCTTAAGGTCCTCGACTGAC 

KQEDP RVEVTQF* 

CCGATCAATGAGCCAGATGAAGGGAACMTCAGGACTATTCCTAGACACTAGCAAAA 

1Q21 + + + + + 1977 

GGCTAGTTACTCGGTCTACTTCCCTTGTTAGTCCTGATAAGGArCTGTGATCGTTTT 
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CTCCTGATAGCAAAAGAACTGAGGAAGCTCTTTCCACTACGGCTGTATTGCACrGGTGAG 
1 + 4- + + + + 60 

GAGGACTATCGTTTTCTTGACTCCTTCGAGAAAGGTGATGCCGACATAACGTGACCACTC 

TCCGGGCCCATGGATGAGAAATTGATGCGAGGATCAATACAAGCTTAATTTGAATTAATA 

61 + + + + + + 120 

AGGCCCGGGTACCTACTCTTTAACTACGCTCCTAGTTATGTTCGAATTAAACTrAATTAT 

AMGGAMTATTTTCTCCCTTTGAACTTATCTCCGTAAAGCCATTGTGCCTCCTCTTGGG 

121 --- + + + + + + 180 

TTTCCTTTATAAAAGAGGGAAACTTGAATAGAGGCATTTCGGTAACACGGAGGAGAACCC 

GGTCACGTGTTCACAATCAATGGCCTTTGAGGAGCTCTTGAGTCAAGTTGGAGGCCTTGG 

181 + + + + + + 240 

CCAGTGCACAAGTGTTAGTTACCGGAAACTCCTCGAGAACTCAGTTCAACCTCCGGAACC 

MAFEELLSQVGGLG- 

GAGATTTCAGATGCTTCATCTGGTTTTTATTCTTCCCTCTCTCATGTTATTAATCCCTCA 

241 + --+ + + +- + 300 

CTCTAAAGTCTACGAAGTAGACCAAAAATAAGMGGGAGAGAGTACAATAATTAGGGAGT 

RFQMLHLVFILPSLMLLIPH- 

TATACTGCTAGAGAACTTTGCTGCAGCCATTCCTGGTCATCGTTGCTGGGTCCACATGCT 

301 + + + + -+ + 360 

ATATGACGATCTCTTGAAACGACGTCGGTAAGGACCAGTAGCAACGACCCAGGTGTACGA 

ILLENFAAAIPGHRCWVHML- 

GGACAATAATACTGGATCTGGTAATGAAACTGGAATCCTCAGTGAAGATGCCCTCTTGAG 
361 + + + --+ + + 420 

CCTGTTATTATGACCTAGACCATTACTTTGACCTTAGGAGTCACTTCTACGGGAGAACTC 

DNNTGSGNETGILSEDALLR- 

AATCTCTATCCCACTAGACTCAAATCTGAGGCCAGAGAAGTGTCGTCGCTTTGTCCATCC 
4-2 A ~ - — _ _ _ „ _ _ -j- „ _ _ , _ _ ~ - - - - - _ _ ^30 

TTAGAGATAGGGTGATCTGAGTTTAGACTCCGGTCTCTTCACAGCAGCGAAACAGGTAGG 

ISIPLDSNLRPEKCRRFVHP- 

CCAGTGGCAGCTTCTTCACCTGAATGGGACTATCCACAGCACAAGTGAGGCAGACACAGA 

481 + + — + + + + 540 

GGTCACCGTCGAAGAAGTGGACTTACCCTGATAGGTGTCGTGTTCACTCCGTCTGTGTCT 

QWQLLHLNGTIHSTSEADTE- 

ACCCTGTGTGGATGGCTGGGTATATGATCAAAGCTACTTCCCTTCGACCATTGTGACTAA 

541 + + + + + + 600 

TGGGACACACCTACCGACCCATATACTAGTTTCGATGAAGGGAAGCTGGTAACACTGATT 

PCVDGWVV DOSYFPSTIVTK- 
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GTGGGACCTGGTATGTGATTATCAGTCACTGAAATCAGT6GTTCAATTCCTACTTCTGAC 

501 --+ + — +- + + + 660 

CACCCTGGACCATACACTAArAGTCAGTGACTTTAGTCACCAAGTTAAGGATGAAGACTG 

WDLVCDYQSLKSVVQF L L L T - 

TGGAATGCTGGTGGGAGGCATCATAGGTGGCCATGTCTCAGACAGGTTTGGGCGAAGATT 

551 + + + + + + 720 

ACCTTACGACCACCCTCCGTAGTATCCACCGGTACAGAGTCTGTCCAAACCCGCTTCTAA 

G M LVGGI IGGHVSDRFGRRF- 

TATTCTCAGATGGTGTTTGCTCCAGCTTGCCATTACTGACACCTGCGCTGCCTTCGCTCC 

721 + + -+-- + + + 780 

ATAAGAGTCTACCACAAACGAGGTCGAACGGTAATGACTGTGGACGACGCGGAAGCGAGG 

I LRWCLLQLAITDTCAAFAP- 

CACCTrCCCTGTTTACTGTGTACTACGCTTCTTGGCAGGTTTTTCTTCCATGATCATTAT 

7gi + + + --+ +-- + 840 

GTGGAAGGGACAAATGACACATGATGCGAAGAACCGTCCAAAAAGAAGGTACTAGTAATA 

TFPVYCVLRFLAGFSSMI I I - 

ATCAAATAATTCTTTGCCCATTACTGAGTGGATAAGGCCCAACTCTAAAGCCCTGGTAGT 

841 ___-+__. + + + +-- + 900 

TAGTTTATTAAGAAACGGGTAATGACTCACCTATTCCGGGTTGAGATTTCGGGACCATCA 

SNNSLPITEWI RP NSKALVV- 

AATATTGTCATCTGGTGCCCTTAGTATTGGACAGATAATCCTGGGAGGCTTGGCTTATGT 

901 + — -+ + + + + 960 

TTATAACAGTAGACCACGGGAATCATAACCTGTCTATTAGGACCCTCCGAACCGAATACA 

I L SSGALS IGQ I ILGGLAYV- 

CTTCCGAGACTGGCAMCCCTGCACGTGGTGGCGTCAGTACCTTTCCTTGGCCTCCTTCT 

961 + + ---+ + + + 1020 

GAAGGCTCTGACCGTTTGGGACGTGCACCACCGCAGTCATGGAAAGGAACCGGAGGAAGA 

FRDWQTLHVVASVPFLGLLL- 

CCTTCAAAGGTGGCTGGTGGAATCTGCTCGGTGGTTGATAATCACCAATAAACTAGATGA 

1021 + + + + + + 1080 

GGMGTTTCCACCGACCACCTTAGACGAGCCACCAACTATTAGTGGTTATTTGATCTACT 

LQRWLVESARWL I ITNKLDE- 

GGGCTTAAAGGCACTTAGAAAAGTTGCACGCACAAATGGAATAAAGAATGCTGAAGAAAC 

1081 + + + + + + 11^0 

CCCGMTTTCCGTGMTCTTTTCMCGTGCGTGTTTACCTTATTTCTTACGACrTCTTTG 

GLKALRKVARFNGIKNAEET- 
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CCT6AACATAGAG67TGTAAGATCCACCATGCAGGAGGAGCTGGATGCAGCACAGACCM 

iA4i + - + + 4- + + i?nn 

GGACTTGTATCTCCAACATTCTAGGTGGTACGTCCTCCTCGACCTACGTCGTGTCTGGTT 
LNIEVVRSTMQEELDAAQTK- 

1201 ' ^T^ 161610 ™ 0 ™™^ 

TTGATGACACACACTGMCMGGCGTTGGGGTCATACGCATm 1260 

TTVCDLFRNPSMRKRICILV- 
ATTTTTGAGATTrGCAAACACMTACCTT rTTATGGTACCATGGTCAATCTTCAGCATGT 

+ + + + __ + + n ™ 

TAAAAACTCTAAACGTTTGTGTTATGGAAAAATACCATGGTACCAGTTAGAAGTCGTACA 
FLRF ANTIPFYGTMVNLQHV- 
1321 GGGGAG ^ AmTCCTG ^ GCAGGTA CTTTATGG 

CCCCTCGTTGTAAAAGGACMCGTCCATC^ 1380 

GSNIFLLQVLYGA VALIVRC- 

1381 I^!T^T C ^ GACACTAAATCATAT6GGCCGTCGM ^ 

AGMCGAGAAAACTGTGATTTAGTATACCC^ 1440 

LALLTLNHMGRRISQILFMF- 
i a At GG ^^TGGGCCTTTCCATTTTGGCCMCACGTTTGTGCCCAAAGAAATGCAGACCCrGCG 
GGACCACCCGGAMGGTAAMCCGGTTGTGCAAACACGGG'TrrcrR 

LVGLSILANTFVPKEMQTLR - 
1501 T G T GGC ^ GGCATGTCTGGGAATCGGCTG ^CTGCTC^^ 

ACACCGAAACCGTACAGACCCTTAGCCGACM 1560 
VALACLG I GCSAATFSSVAV- 
1561 I CA ^II CA IJ GAACTCATC C^ 

AGTGMGTMCTTGAGTAGGGGTGACMGAGTCCCGTTCTCGAAGTCC 1 62 ° 

HFI ELIPTVLRARASGJDLT - 

1621 GGC I AG I AGGA ™ 

CCGATCATCCTMCCTCGTCGTGACCGAGGGGAGMCTA^ 1680 

ASRIGAALAPLLMTLTVFFT- 
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CACTTTGCCATGGATCAT T rATGGAATCTTCCCCATCATTGGTGGCCTTATTGTCTTCCT 

1681 + + + + + + 1740 

GTGAAACGGTACCTAGTAAATACCTTAGAAGGGGTAGTAACCACCGGAATAACAGAAGGA 

TLPWI IYGIFPI IGGLIVFL- 

CCTACCAGAAACCAAGAATCTGCCTTTGCCTGACACCATCAAGGATGTGGAAAATCAAAA 

1741 +-- + 4- + + + 1800 

GGATGGTCTTTGGTrCTTAGACGGAMCGGACTGTGGTAGTTCCTACACCTTTTAGTTTT 

LPETKNLPLPDTIKDVENQK- 

AAAAMTCTCMGGAAMGGCATAAAAATGATTGCTACACAAAAGTGACCAAATTTTAAG 

1801 + + + + + + 1860 

TTTTTTAGAGTTCCTTTTCCGTATTTTTACTMCGATGTGTTTTCACTGGTTrAAAATTC 

KNLKEKA* 

MGCCTTCATGAGCTGATrGGTGGGGAMTTCAGAAAAAAAAATACAGGAAAAGAACACA 

1861 -- + + + + + + 1920 

TTCGGAAGTACTCGACTAACCACCCCTTTAAGTC 1 1 1 1 1 1 1 1 1 ATGTCCTTTTCTTGTGT 



CCAGMGGGTrTTTTTCCCTACAACCAGCAAGAACATATATTAGATACATGAATCTCAAT 

1921 + + --+ + + + 1980 

GGTCTTCCCAAAAAAAGGGATGTTGGTCGTTCTTGTATATAATCTATGTACTTAGAGTTA 



TATMTTATGGCATTMTTTGCATTTTATTTCAAAATTAACTTGTGGGGACATGTAATCT' 

1981 + + + + + -+ 2040 

ATATTAATACCGTAATTAAACGTAAAATAAAGTTTTAATTGAACACCCaGTACATTAGA 



CTTGAGCMTCTGATATmTGGGMGTCCTTTAAAAAGTTACAAATTTATCAATAAATT 

2041 + + + + + + 2100 

GMCTCGTTAGACTATAAAMCCCTTCAGGAMTTTTrCMTGTTTAMTAGTTATTTAA 



ACTAGTAGATAAGATGATTCAGAAACAAAGGAAAATCACAGAATTAGGATGTGGCTGGCT 

2101 + + + + + + 2160 

TGATCATCTATTCTACTMGTCTTTGTTrCCTTTTAGrGTCTTAATCCTACACCGACCGA 



TGGTGTATGAAGCACCATGTGATGAATTCATAAAGTTGCAAAAGTCAAAACAATACTGTA 

2161 - + +• + +--- + + 2220 

ACCACATACTTCGTGGTACACTACTrMGTATTTCAACGTTTTCAGTTTTGTTATGACAT 

CATGCMCCAGAAATCAAATTAAATCCAGAAATAGAGACCTATATAAATGCATTTAATAC 

2221 + + + + + + 2280 

GTACGTTGGTCTTTAGTTTAATTTAGGTCTTrATCTCTGGATATATTTACGTAAATTATG 



ATGATACTTTTGACATATTAAGCCATTGGAAAACGGAAGGATTAGATACTTAAATAACAT 

2281 + + + + + + 2340 

TACTATGAAAACTGTATAATTCGGTAACCTTTTGCCTTCCTAATCTATGAATTTATrGTA 



TGCTATCTCTTTGTAAATACAGTCACTAAATGATGTTAGTTACTTTTCCATGGTGGAATT 
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TTMTTACTTTTTCTTTGTMTTTTTCTCTCTGTATATTTTAAACAAATA6CT6GTATAG 

2401 + + + +-- --+-- --+ 2460 

MTTMTGAAAMGAAACATTAAAAAGAGAGACATATAAMTTTGTTTATCGACCATATC 

TTTACMTATTATAMGATATTGTTCAAATTGMGGGCAMGGCCAGGTTCAGCMTTTT 

2461 + + + + ---+ + 2520 

AMTGTTATMTATTTCTATAACAAGTTrAACTTCCCGTTTCCGGTCCAAGTCGTTAAAA 

CAMCTGTATGTACATTTMTAAAATAACTATAAATTAAAAAATTATATTTCAAATGATG 

2521 + + + + + + 2580 

GTTTGACATACATGTAMTTATTTTATTGATATTTMTrTTrrMTATAMGTTTACTAC 

TGACTAATAAATGAAAGTACATATAGTAGTAAAGTAATTTCAGGCAAACCTATATAACCA 

2581 + + -+- + + + 2640 

ACTGATTATTTACTTTCATGTATATCATCATTrCATTAAAGTCCGTTTGGATATATTGGT 

AAATATAAACTTTCATTTTAAACAGCAAAAAAAAAAAAAAAAAA 

2641 + + +--- + 2684 

TTTATATTTGAAAGTAAAATTTGTCGTT 1 1 1 1 1 1 I I 1 1 1 I I ITT 



FIG. 6E 
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